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COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND TREATMENT OF TUMOR 

FIELD OP THE INVENTION 
The present invention is directed to compositions of matter useful for the diagnosis and treatment of 
tumor in mammals and to methods of using those compositions of matter for the same. 

5 

BACKGROUND OF THE INVENTION 
Malignant tumors (cancers) are the second leading cause of death in the United States, after heart 
disease (Boring et al., CA Cancel J. Clin. 43:7 (1993)). Cancer is characterized by the increase in the number 
of abnormal, or neoplastic, cells derived from a normal tissue which proliferate to form a tumor mass, the 
10 invasion of adjacent tissues by these neoplastic tumor cells, and the generation of malignant cells which 

eventually spread via the blood or lymphatic system to regional lymph nodes and to distant sites via a process 
called metastasis. In a cancerous state, a cell proliferates under conditions in which normal cells would not 
grow. Cancer manifests itself in a wide variety of forms, characterized by different degrees of invasiveness 
and aggressiveness. 

15 In attempts to discover effective cellular targets for cancer diagnosis and therapy, researchers have 

sought to identify transmembrane or otherwise membrane-associated polypeptides that are specifically expressed 
on the surface of one or more particular type(s) of cancer cell as compared to on one or more normal non- 
cancerous cell(s). Often, such membrane-associated polypeptides are more abundantly expressed on the surface 
of the cancer cells as compared to on the surface of the non-cancerous cells. The identification of such tumor- 

20 associated cell surface antigen polypeptides has given rise to the ability to specifically target cancer cells for 
destruction via antibody-based therapies. In this regard, it is noted that antibody-based therapy has proved very 
effective in the treatment of certain cancers. For example, HERCEPTIN® and RITUXAN® (both from 
Genentech Inc. , South San Francisco, California) are antibodies that have been used successfully to treat breast 
cancer and non-Hodgkin's lymphoma, respectively. More specifically, HERCEPTIN® is a recombinant 

25 DNA-derived humanized monoclonal antibody that selectively binds to the extracellular domain of the human 
epidermal growth factor receptor 2 (HER2) proto-oncogene. HER2 protein overexpression is observed in 
25-30% of primary breast cancers. RITUXAN® is a genetically engineered chimeric murine/human 
monoclonal antibody directed against the CD20 antigen found on the surface of normal and malignant B 
lymphocytes. Both these antibodies are recombinantly produced in CHO cells. 

30 In other attempts to discover effective cellular targets for cancer diagnosis and therapy, researchers 

have sought to identify (1) non-membrane-associated polypeptides that are specifically produced by one or more 
particular type(s) of cancer cell(s) as compared to by one or more particular type(s) of non-cancerous normal 
cell(s), (2) polypeptides that are produced by cancer cells at an expression level that is significantly higher than 
that of one or more normal non-cancerous cell(s), or (3) polypeptides whose expression is specifically limited 
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to only a single (or very limited number of different) tissue type(s) in both the cancerous and non-cancerous state 
(e.g., normal prostate and prostate tumor tissue). Such polypeptides may remain intracellularly located or may 
be secreted by the cancer cell. Moreover, such polypeptides may be expressed not by the cancer cell itself, but 
rather by cells which produce and/or secrete polypeptides having a potentiating or growth-enhancing effect on 
cancer cells. Such secreted polypeptides are often proteins that provide cancer cells with a growth advantage 
5 over normal cells and include such things as, for example, angiogenic factors, cellular adhesion factors, growth 
factors, and the like. Identification of antagonists of such non-membrane associated polypeptides would be 
expected to serve as effective therapeutic agents for the treatment of such cancers. Furthermore, identification 
of the expression pattern of such polypeptides would be useful for the diagnosis of particular cancers in 
mammals. 

1 0 Despite the above identified advances in mammalian cancer therapy, there is a great need for additional 

diagnostic and therapeutic agents capable of detecting the presence of tumor in a mammal and for effectively 
inhibiting neoplastic cell growth, respectively. Accordingly, it is an objective of the present invention to 
identify: (1) cell membrane-associated polypeptides that are more abundantly expressed on one or more type(s) 
of cancer cell(s) as compared to on normal cells or on other different cancer cells, (2) non-membrane-associated 

1 5 polypeptides that are specifically produced by one or more particular type(s) of cancer cell(s) (or by other cells 
that produce polypeptides having a potentiating effect on the growth of cancer cells) as compared to by one or 
more particular type(s) of non-cancerous normal cell(s), (3) non-membrane-associated polypeptides that are 
produced by cancer cells at an expression level that is significantly higher than that of one or more normal non- 
cancerous cell(s), or (4) polypeptides whose expression is specifically limited to only a single (or very limited 

20 number of different) tissue type(s) in both a cancerous and non-cancerous state (e.g., normal prostate and 
prostate tumor tissue), and to use those polypeptides, and their encoding nucleic acids, to produce compositions 
of matter useful in the therapeutic treatment and diagnostic detection of cancer in mammals. It is also an 
objective of the present invention to identify cell membrane-associated, secreted or intracellular polypeptides 
whose expression is limited to a single or very limited number of tissues, and to use those polypeptides, and 

25 their encoding nucleic acids, to produce compositions of matter useful in the therapeutic treatment and diagnostic 
detection of cancer in mammals. 

SUMMARY OF THE INVENTION 

A. Embodiments 

30 In the present specification, Applicants describe for the first time the identification of various cellular 

polypeptides (and their encoding nucleic acids or fragments thereof) which are expressed to a greater degree 
on the surface of or by one or more types of cancer cell(s) as compared to on the surface of or by one or more 
types of normal non-cancer cells. Alternatively, such polypeptides are expressed by cells which produce and/or 
secrete polypeptides having a potentiating or growth-enhancing effect on cancer cells . Again alternatively, such 

35 polypeptides may not be overexpressed by tumor cells as compared to normal cells of the same tissue type, but 
rather may be specifically expressed by both tumor cells and normal cells of only a single or very limited 
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number of tissue types (preferably tissues which are not essential for life, e.g. , prostate, etc.). All of the above 
polypeptides are herein referred to as Tumor-associated Antigenic Target polypeptides ( a TAT w polypeptides) 
and are expected to serve as effective targets for cancer therapy and diagnosis in mammals. 

Accordingly, in one embodiment of the present invention, the invention provides an isolated nucleic 

5 acid molecule having a nucleotide sequence that encodes a tumor-associated antigenic target polypeptide or 
fragment thereof (a "TAT" polypeptide). 

In certain aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% nucleic acid sequence identity, 

10 to (a) a DNA molecule encoding a full-length TAT polypeptide having an amino acid sequence as disclosed 
herein, a TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein, 
or (b) the complement of the DNA molecule of (a). 

15 In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 

about 80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% nucleic acid sequence identity, 
to (a) a DNA molecule comprising the coding sequence of a full-length TAT polypeptide cDNA as disclosed 
herein, the coding sequence of a TAT polypeptide lacking the signal peptide as disclosed herein, the coding 

20 sequence of an extracellular domain of a transmembrane TAT polypeptide, with or without the signal peptide, 
as disclosed herein or the coding sequence of any other specifically defined fragment of the full-length TAT 
polypeptide amino acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In further aspects, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 
sequence having at least about 80 % nucleic acid sequence identity, alternatively at least about 8 1 % , 82 % , 83 % , 

25 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% 
nucleic acid sequence identity, to (a) a DNA molecule that encodes the same mature polypeptide encoded by 
the full-length coding region of any of the human protein cDNAs deposited with the ATCC as disclosed herein, 
or (b) the complement of the DNA molecule of (a). 

Another aspect of the invention provides an isolated nucleic acid molecule comprising a nucleotide 

30 sequence encoding a TAT polypeptide which is either transmembrane domain-deleted or transmembrane domain- 
inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain(s) 
of such polypeptide(s) are disclosed herein. Therefore, soluble extracellular domains of the herein described 
TAT polypeptides are contemplated. 

In other aspects, the present invention is directed to isolated nucleic acid molecules which hybridize 

35 to (a) a nucleotide sequence encoding a TAT polypeptide having a full-length amino acid sequence as disclosed 
herein, a TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
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domain of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein, 
or (b) the complement of the nucleotide sequence of (a). In this regard, an embodiment of the present invention 
is directed to fragments of a full-length TAT polypeptide coding sequence, or the complement thereof, as 
disclosed herein, that may find use as, for example, hybridization probes useful as, for example, diagnostic 
5 probes, antisense oligonucleotide probes, or for encoding fragments of a full-length TAT polypeptide that may 
optionally encode a polypeptide comprising a binding site for an anti-TAT polypeptide antibody, a TAT binding 
oligopeptide or other small organic molecule that binds to a TAT polypeptide. Such nucleic acid fragments are 
usually at least about 5 nucleotides in length, alternatively at least about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 

10 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 
220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 
430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 
640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 
850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides in length, 

15 wherein in this context the term "about" means the referenced nucleotide sequence length plus or minus 10% 
of that referenced length. It is noted that novel fragments of a TAT polypeptide-encoding nucleotide sequence 
may be determined in a routine manner by aligning the TAT polypeptide-encoding nucleotide sequence with 
other known nucleotide sequences using any of a number of well known sequence alignment programs and 
determining which TAT polypeptide-encoding nucleotide sequence fragment(s) are novel. All of such novel 

20 fragments of TAT polypeptide-encoding nucleotide sequences are contemplated herein. Also contemplated are 
the TAT polypeptide fragments encoded by these nucleotide molecule fragments, preferably those TAT 
polypeptide fragments that comprise a binding site for an anti-TAT antibody, a TAT binding oligopeptide or 
other small organic molecule that binds to a TAT polypeptide. 

In another embodiment, the invention provides isolated TAT polypeptides encoded by any of the 

25 isolated nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated TAT polypeptide, comprising an amino acid 
sequence having at least about 80 % amino acid sequence identity, alternatively at least about 8 1 % , 82 % , 83 % , 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% amino 
acid sequence identity, to a TAT polypeptide having a full-length amino acid sequence as disclosed herein, a 

30 TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain 
of a transmembrane TAT polypeptide protein, with or without the signal peptide, as disclosed herein, an amino 
acid sequence encoded by any of the nucleic acid sequences disclosed herein or any other specifically defined 
fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein. 

In a further aspect, the invention concerns an isolated TAT polypeptide comprising an amino acid 

3 5 sequence having at least about 80 % amino acid sequence identity, alternatively at least about 8 1 % , 82 % , 83 % , 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% amino acid 
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sequence identity, to an amino acid sequence encoded by any of the human protein cDNAs deposited with the 
ATCC as disclosed herein. 

In a specific aspect, the invention provides an isolated TAT polypeptide without the N-terminal signal 
sequence and/or without the initiating methionine and is encoded by a nucleotide sequence that encodes such 
an amino acid sequence as hereinbefore described. Processes for producing the same are also herein described, 
5 wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate 
encoding nucleic acid molecule under conditions suitable for expression of the TAT polypeptide and recovering 
the TAT polypeptide from the cell culture. 

Another aspect of the invention provides an isolated TAT polypeptide which is either transmembrane 
domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
10 described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the TAT polypeptide and 
recovering the TAT polypeptide from the cell culture. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described polypeptides. Host cells comprising any such vector are also provided. 
15 By way of example, the host cells may be CHO cells, E. coli cells, or yeast cells. A process for producing any 
of the herein described polypeptides is further provided and comprises culturing host cells under conditions 
suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell culture. 

In other embodiments, the invention provides isolated chimeric polypeptides comprising any of the 
herein described TAT polypeptides fused to a heterologous (non-TAT) polypeptide. Example of such chimeric 
20 molecules comprise any of the herein described TAT polypeptides fused to a heterologous polypeptide such as, 
for example, an epitope tag sequence or a Fc region of an immunoglobulin. 

In another embodiment, the invention provides an antibody which binds, preferably specifically, to 
any of the above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, antibody 
fragment, chimeric antibody, humanized antibody, single-chain antibody or antibody that competitively inhibits 
25 the binding of an anti-TAT polypeptide antibody to its respective antigenic epitope. Antibodies of the present 
invention may optionally be conjugated to a growth inhibitory agent or cytotoxic agent such as a toxin, 
including, for example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic 
enzyme, or the like. The antibodies of the present invention may optionally be produced in CHO cells or 
bacterial cells and preferably induce death of a cell to which they bind. For diagnostic purposes, the antibodies 
30 of the present invention may be detectably labeled, attached to a solid support, or the like. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described antibodies. Host cell comprising any such vector are also provided. By 
way of example, the host cells may be CHO cells, E. coli cells, or yeast cells. A process for producing any 
of the herein described antibodies is further provided and comprises culturing host cells under conditions suitable 
35 for expression of the desired antibody and recovering the desired antibody from the cell culture. 

In another embodiment, the invention provides oligopeptides ( W TAT binding oligopeptides") which 
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bind, preferably specifically, to any of the above or below described TAT polypeptides. Optionally, the TAT 
binding oligopeptides of the present invention may be conjugated to a growth inhibitory agent or cytotoxic agent 
such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, 
a nucleolytic enzyme, or the like. The TAT binding oligopeptides of the present invention may optionally be 
produced in CHO cells or bacterial cells and preferably induce death of a cell to which they bind. For 
5 diagnostic purposes, the TAT binding oligopeptides of the present invention may be detectably labeled, attached 
to a solid support, or the like. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described TAT binding oligopeptides. Host cell comprising any such vector are also 
provided. By way of example, the host cells may be CHO cells, E. coli cells, or yeast cells. A process for 

10 producing any of the herein described TAT binding oligopeptides is further provided and comprises culturing 
host cells under conditions suitable for expression of the desired oligopeptide and recovering the desired 
oligopeptide from the cell culture. 

In another embodiment, the invention provides small organic molecules ("TAT binding organic 
molecules") which bind, preferably specifically, to any of the above or below described TAT polypeptides. 

15 Optionally, the TAT binding organic molecules of the present invention may be conjugated to a growth 

inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or calicheamicin, 
an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The TAT binding organic molecules of 
the present invention preferably induce death of a cell to which they bind. For diagnostic purposes, the TAT 
binding organic molecules of the present invention may be detectably labeled, attached to a solid support, or 

20 the like. 

In a still further embodiment, the invention concerns a composition of matter comprising a TAT 
polypeptide as described herein, a chimeric TAT polypeptide as described herein, an anti-TAT antibody as 
described herein, a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as 
described herein, in combination with a carrier. Optionally, the carrier is a pharmaceutical^ acceptable carrier. 

25 In yet another embodiment, the invention concerns an article of manufacture comprising a container 

and a composition of matter contained within the container, wherein the composition of matter may comprise 
a TAT polypeptide as described herein, a chimeric TAT polypeptide as described herein, an anti-TAT antibody 
as described herein, a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as 
described herein. The article may further optionally comprise a label affixed to the container, or a package 

30 insert included with the container, that refers to the use of the composition of matter for the therapeutic 
treatment or diagnostic detection of a tumor. 

Another embodiment of the present invention is directed to the use of a TAT polypeptide as described 
herein, a chimeric TAT polypeptide as described herein, an anti-TAT polypeptide antibody as described herein, 
a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as described herein, for 

35 the preparation of a medicament useful in the treatment of a condition which is responsive to the TAT 

polypeptide, chimeric TAT polypeptide, anti-TAT polypeptide antibody, TAT binding oligopeptide, or TAT 
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binding organic molecule. 

B. Additional Embodiments 

Another embodiment of the present invention is directed to a method for inhibiting the growth of a cell 
that expresses a TAT polypeptide, wherein the method comprises contacting the cell with an antibody, an 
oligopeptide or a small organic molecule that binds to the TAT polypeptide, and wherein the binding of the 
5 antibody, oligopeptide or organic molecule to the TAT polypeptide causes inhibition of the growth of the cell 
expressing the TAT polypeptide. In preferred embodiments, the cell is a cancer cell and binding of the 
antibody, oligopeptide or organic molecule to the TAT polypeptide causes death of the cell expressing the TAT 
polypeptide. Optionally, the antibody is a monoclonal antibody, antibody fragment, chimeric antibody, 
humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and TAT binding organic 

1 0 molecules employed in the methods of the present invention may optionally be conjugated to a growth inhibitory 
agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, 
a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and TAT binding oligopeptides 
employed in the methods of the present invention may optionally be produced in CHO cells or bacterial cells. 
Yet another embodiment of the present invention is directed to a method of therapeutically treating a 

15 mammal having a cancerous tumor comprising cells that express a TAT polypeptide, wherein the method 
comprises administering to the mammal a therapeutically effective amount of an antibody, an oligopeptide or 
a small organic molecule that binds to the TAT polypeptide, thereby resulting in the effective therapeutic 
treatment of the tumor. Optionally, the antibody is a monoclonal antibody, antibody fragment, chimeric 
antibody, humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and TAT 

20 binding organic molecules employed in the methods of the present invention may optionally be conjugated to 
a growth inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or 
calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and 
oligopeptides employed in the methods of the present invention may optionally be produced in CHO cells or 
bacterial cells. 

25 Yet another embodiment of the present invention is directed to a method of determining the presence 

of a TAT polypeptide in a sample suspected of containing the TAT polypeptide, wherein the method comprises 
exposing the sample to an antibody, oligopeptide or small organic molecule that binds to the TAT polypeptide 
and determining binding of the antibody, oligopeptide or organic molecule to the TAT polypeptide in the sample, 
wherein the presence of such binding is indicative of the presence of the TAT polypeptide in the sample. 

30 Optionally, the sample may contain cells (which may be cancer cells) suspected of expressing the TAT 

polypeptide. The antibody, TAT binding oligopeptide or TAT binding organic molecule employed in the 
method may optionally be detectably labeled, attached to a solid support, or the like. 

A further embodiment of the present invention is directed to a method of diagnosing the presence of 
a tumor in a mammal, wherein the method comprises detecting the level of expression of a gene encoding a 

35 TAT polypeptide (a) in a test sample of tissue cells obtained from said mammal, and (b) in a control sample of 
known normal non-cancerous cells of the same tissue origin or type, wherein a higher level of expression of the 
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TAT polypeptide in the test sample, as compared to the control sample, is indicative of the presence of tumor 
in the mammal from which the test sample was obtained. 

Another embodiment of the present invention is directed to a method of diagnosing the presence of a 
tumor in a mammal, wherein the method comprises (a) contacting a test sample comprising tissue cells obtained 
from the mammal with an antibody, oligopeptide or small organic molecule that binds to a TAT polypeptide and 
5 (b) detecting the formation of a complex between the antibody, oligopeptide or small organic molecule and the 
TAT polypeptide in the test sample, wherein the formation of a complex is indicative of the presence of a tumor 
in the mammal. Optionally, the antibody, TAT binding oligopeptide or TAT binding organic molecule 
employed is detectably labeled, attached to a solid support, or the like, and/or the test sample of tissue cells is 
obtained from an individual suspected of having a cancerous tumor. 

1 0 Yet another embodiment of the present invention is directed to a method for treating or preventing a 

cell proliferative disorder associated with altered, preferably increased, expression or activity of a TAT 
polypeptide, the method comprising administering to a subject in need of such treatment an effective amount 
of an antagonist of a TAT polypeptide. Preferably, the cell proliferative disorder is cancer and the antagonist 
of the TAT polypeptide is an anti-TAT polypeptide antibody, TAT binding oligopeptide, TAT binding organic 

15 molecule or antisense oligonucleotide. Effective treatment or prevention of the cell proliferative disorder may 
be a result of direct killing or growth inhibition of cells that express a TAT polypeptide or by antagonizing the 
cell growth potentiating activity of a TAT polypeptide. 

Yet another embodiment of the present invention is directed to a method of binding an antibody, 
oligopeptide or small organic molecule to a cell that expresses a TAT polypeptide, wherein the method 

20 comprises contacting a cell that expresses a TAT polypeptide with said antibody, oligopeptide or small organic 
molecule under conditions which are suitable for binding of the antibody, oligopeptide or small organic molecule 
to said TAT polypeptide and allowing binding therebetween. 

Other embodiments of the present invention are directed to the use of (a) a TAT polypeptide, (b) a 
nucleic acid encoding a TAT polypeptide or a vector or host cell comprising that nucleic acid, (c) an anti-TAT 

25 polypeptide antibody, (d) a TAT-binding oligopeptide, or (e) a TAT-binding small organic molecule in the 
preparation of a medicament useful for (i) the therapeutic treatment or diagnostic detection of a cancer or tumor, 
or (ii) the therapeutic treatment or prevention of a cell proliferative disorder. 

Another embodiment of the present invention is directed to a method for inhibiting the growth of a 
cancer cell, wherein the growth of said cancer cell is at least in part dependent upon the growth potentiating 

30 effect(s) of a TAT polypeptide (wherein the TAT polypeptide may be expressed either by the cancer cell itself 
or a cell that produces polypeptide(s) that have a growth potentiating effect on cancer cells) , wherein the method 
comprises contacting the TAT polypeptide with an antibody, an oligopeptide or a small organic molecule that 
binds to the TAT polypeptide, thereby antagonizing the growth-potentiating activity of the TAT polypeptide and, 
in turn, inhibiting the growth of the cancer cell. Preferably the growth of the cancer cell is completely inhibited. 

3 5 Even more preferably, binding of the antibody, oligopeptide or small organic molecule to the TAT polypeptide 
induces the death of the cancer cell. Optionally, the antibody is a monoclonal antibody, antibody fragment, 
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chimeric antibody, humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and 
TAT binding organic molecules employed in the methods of the present invention may optionally be conjugated 
to a growth inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or 
calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and TAT 
binding oligopeptides employed in the methods of the present invention may optionally be produced in CHO 
5 cells or bacterial cells. 

Yet another embodiment of the present invention is directed to a method of therapeutically treating a 
tumor in a mammal, wherein the growth of said tumor is at least in part dependent upon the growth potentiating 
effect(s) of a TAT polypeptide, wherein the method comprises administering to the mammal a therapeutically 
effective amount of an antibody, an oligopeptide or a small organic molecule that binds to the TAT polypeptide, 

10 thereby antagonizing the growth potentiating activity of said TAT polypeptide and resulting in the effective 
therapeutic treatment of the tumor. Optionally, the antibody is a monoclonal antibody, antibody fragment, 
chimeric antibody, humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and 
TAT binding organic molecules employed in the methods of the present invention may optionally be conjugated 
to a growth inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or 

15 calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and 

oligopeptides employed in the methods of the present invention may optionally be produced in CHO cells or 
bacterial cells. 

Yet further embodiments of the present invention will be evident to the skilled artisan upon a reading 
of the present specification. 

20 

BRIEF DESCRIPTION OP THE DRAWINGS 
In the list of figures for the present application, specific cDNA sequences which are upregulated in 
certain tumor tissues as compared to their normal tissue counterparts are individually identified with a 
designation beginning with the letters "DNA" followed by a specific numerical designation. A full or partial 
25 length protein sequence that is encoded by a cDNA sequence identified and shown herein is individually 
identified with a designation beginning with the letters "PRO" followed by a specific numerical designation. 
Figures showing encoded amino acid sequences immediately follow the figure showing the cDNA sequence 
encoding that specific amino acid sequence. If start and/or stop codons have been identified in a cDNA 
sequence shown in the attached figures, they are shown in bold and underlined font. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I, Definitions 

Hie terms "TAT polypeptide" and "TAT" as used herein and when immediately followed by a 
numerical designation, refer to various polypeptides, wherein the complete designation (i.e. ,TAT/number) refers 
to specific polypeptide sequences as described herein. The terms "TAT/number polypeptide" and 
5 TAT/number" wherein the term "number" is provided as an actual numerical designation as used herein 

encompass native sequence polypeptides, polypeptide variants and fragments of native sequence polypeptides 
and polypeptide variants (which are further defined herein). The TAT polypeptides described herein may be 
isolated from a variety of sources, such as from human tissue types or from another source, or prepared by 
recombinant or synthetic methods. The term "TAT polypeptide" refers to each individual TAT/number 

10 polypeptide disclosed herein. All disclosures in this specification which refer to the "TAT polypeptide" refer 
to each of the polypeptides individually as well as jointly. For example, descriptions of the preparation of, 
purification of, derivation of, formation of antibodies to or against, formation of TAT binding oligopeptides to 
or against, formation of TAT binding organic molecules to or against, administration of, compositions 
containing, treatmentof a disease with, etc., pertain to each polypeptide of the invention individually. The term 

15 "TAT polypeptide" also includes variants of the TAT/number polypeptides disclosed herein. 

A "native sequence TAT polypeptide" comprises a polypeptide having the same amino acid sequence 
as the corresponding TAT polypeptide derived from nature. Such native sequence TAT polypeptides can be 
isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence TAT 
polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific TAT 

20 polypeptide (e.g., an extracellular domain sequence), naturally-occurring variant forms (e.g., alternatively 
spliced forms) and natnratty-occurring allelic variants of the polypeptide. In certain embodiments of the 
invention, the native sequence TAT polypeptides disclosed herein are mature or full-length native sequence 
polypeptides comprising the full-length amino acids sequences shown in the accompanying figures. Start and 
stop codons (if indicated) are shown in bold font and underlined in the figures. Nucleic acid residues indicated 

25 as W N" in the accompanying figures are any nucleic acid residue. However, while the TAT polypeptides 

disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino 
acid position 1 in the figures, it is conceivable and possible that other methionine residues located either 
upstream or downstream from the amino acid position 1 in die figures may be employed as the starting amino 
acid residue for the TAT polypeptides. 

30 The TAT polypeptide "extracellular domain* or "ECD" refers to a form of the TAT polypeptide which 

is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a TAT polypeptide ECD will 
have less than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% 
of such domains. It will be understood that any transmembrane domains identified for the TAT polypeptides 
of the present invention are identified pursuant to criteria routinely employed in the art for identifying that type 

35 of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more 
than about 5 amino acids at either end of the domain as initially identified herein. Optionally, therefore, an 
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extracellular domain of a TAT polypeptide may contain from about 5 or fewer amino acids on either side of the 
transmembrane domain/extracellular domain boundary as identified in the Examples or specification and such 
polypeptides, with or without the associated signal peptide, and nucleic acid encoding them, are contemplated 
by the present inventioru 

The approximate location of the "signal peptides" of the various TAT polypeptides disclosed herein 
5 may be shown in the present specification and/or the accompanying figures. It is noted, however, that the C- 
terminal boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on either 
side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary 
of the signal peptide may be identified pursuant to criteria routinely employed in the art for identifying that type 
of amino acid sequence element (e.g., Nielsen et aL, Prot. Eng. 10:1-6 (1997) and von Heinje et al., Nucl. 

10 Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, uv some cases, cleavage of a signal 
sequence from a secreted polypeptide is not entirely uniform, resulting in more than one secreted species . These 
mature polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on either side 
of the C-terminal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, 
are contemplated by the present invention. 

15 "TAT polypeptide variant" means a TAT polypeptide, preferably an active TAT polypeptide, as defined 

herein having at least about 80% amino acid sequence identity with a full-length native sequence TAT 
polypeptide sequence as disclosed herein, a TAT polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a TAT polypeptide, with or without the signal peptide, as disclosed herein 
or any other fragment of a full-length TAT polypeptide sequence as disclosed herein (such as those encoded by 

20 a nucleic acid that represents only a portion of the complete coding sequence for a full-length TAT polypeptide) . 
Such TAT polypeptide variants include, for instance, TAT polypeptides wherein one or more amino acid 
residues are added, or deleted, at the N- or C-terminus of the full-length native amino acid sequence. 
Ordinarily, a TAT polypeptide variant will have at least about 80% amino acid sequence identity, alternatively 
at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 

25 97%, 98%, or 99% amino acid sequence identity, to a full-length native sequence TAT polypeptide sequence 
as disclosed herein, a TAT polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a TAT polypeptide, with or without the signal peptide, as disclosed herein or any other specifically 
defined fragment of a full-length TAT polypeptide sequence as disclosed herein. Ordinarily, TAT variant 
polypeptides are at least about 10 amino acids in length, alternatively at least about 20, 30, 40, 50, 60, 70, 80, 

30 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 
310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 
520, 530, 540, 550, 560, 570, 580, 590, 600 amino acids in length, or more. Optionally, TAT variant 
polypeptides will have no more than one conservative amino acid substitution as compared to the native TAT 
polypeptide sequence, alternatively no more than 2, 3, 4, 5, 6, 7, 8, 9, or 10 conservative amino acid 

35 substitution as compared to the native TAT polypeptide sequence. 

"Percent (%) amino acid sequence identity" with respect to the TAT polypeptide sequences identified 
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herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific TAT polypeptide sequence*, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 
substitutions as part of the sequence identity. Alignment for purposes of deterrnining percent amino acid 
sequence identity can be achieved in various ways.that are within the skill in the art, for instance, using publicly 
available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms 
needed to achieve m a xim al alignment over the full length of the sequences being compared. For purposes 
herein, however, % amino acid sequence identity values are generated using the sequence comparison computer 
program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Table 1 below. 
The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code 
showninTable 1 below has been filed with user documentation in the U.S. Copyright Office, WashingtonD.C, 
20559, where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is 
publicly available through Genentech, Inc. , South San Francisco, California or may be compiled from the source 
code provided in Table 1 below. The ALIGN-2 program should be compiled for use on a UNIX operating 
system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 
program and do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid 
sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in 
B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity 
of B to A. As examples of % amino acid sequence identity calculations using this method, Tables 2 and 3 
demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated 
"Comparison Protein" to the amino acid sequence designated tt TAT n , wherein "TAT" represents the amino acid 
sequence of a hypothetical TAT polypeptide of interest, "Comparison Protein" represents the amino acid 
sequence of a polypeptide against which the "TAT W polypeptide of interest is being compared, and "X, "Y M and 
"Z " each represent different hypothetical amino acid residues. Unless specifically stated otherwise, all % amino 
acid sequence identity values used herein are obtained as described in the immediately preceding paragraph using 
the ALIGN-2 computer program. 

TAT variant polynucleotide" or "TAT variant nucleic acid sequence" means a nucleic acid molecule 
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which encodes a TAT polypeptide, preferably an active TAT polypeptide, as defined herein and which has at 
least about 80% nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native 
sequence TAT polypeptide sequence as disclosed herein, afull-length native sequence TAT polypeptide sequence 
lacking the signal peptide as disclosed herein, an extracellular domain of a TAT polypeptide, with or without 
the signal peptide, as disclosed herein or any other fragment of a full-length TAT polypeptide sequence as 
5 disclosed herein (such as those encoded by a nucleic acid that represents only a portion of the. complete coding 
sequence for a full-length TAT polypeptide). Ordinarily, a TAT variant polynucleotide will have at least about 
80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 
89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity with a 
nucleic acid sequence encoding a full-length native sequence TAT polypeptide sequence as disclosed herein, a 
10 full-length native sequence TAT polypeptide sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a TAT polypeptide, with or without the signal sequence, as disclosed herein or any other 
fragment of a full-length TAT polypeptide sequence as disclosed herein. Variants do not encompass the native 
nucleotide sequence. 

Ordinarily, TAT variant polynucleotides are at least about 5 nucleotides in length, alternatively at least 

15 about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21. 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 
50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 
170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 
350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 
560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 

20 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 
980, 990, or 1000 nucleotides in length, wherein in this context the term "about" means the referenced 
nucleotide sequence length plus or minus 10% of that referenced length. 

"Percent (%) nucleic acid sequence identity" with respect to TAT-encoding nucleic acid sequences 
identified herein is defined as die percentage of nucleotides in a candidate sequence that are identical with die 

25 nucleotides in the TAT nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, 
using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) 
software. For purposes herein, however, % nucleic acid sequence identity values are generated using the 

30 sequence comparison computer program ALIGN-2, wherein the complete source code for the AUGN-2 
program is provided in Table 1 below. The ALIGN-2 sequence comparison computer program was authored 
by Genentech, Inc. and the source code shown in Table 1 below has been filed with user documentation in the 
U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. 
TXU5 10087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, 

35 California or may be compiled from the source code provided in Table 1 below. The ALIGN-2 program should 
be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison 
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parameters are set by tiie AUGN-2 program and do not vary. 

In situations where AUGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 
s sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

5 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program AUGN-2 
in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be 

1 0 appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to 
C. As examples of % nucleic acid sequence identity calculations, Tables 4 and 5, demonstrate how to calculate 
the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA* to the nucleic 
acid sequence designated "TAT-DNA" , wherein "TAT-DNA" represents a hypothetical TAT-encoding nucleic 

15 acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule 
against which the "TAT-DNA" nucleic acid molecule of interest is being compared, and "N\ and " V" each 
represent different hypothetical nucleotides. Unless specifically stated otherwise, all % nucleic acid sequence 
identity values used herein are obtained as described in the immediately preceding paragraph using the AUGN-2 
computer program. 

20 In other embodiments, TAT variant polynucleotides are nucleic acid molecules that encode a TAT 

polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash conditions, 
to nucleotide sequences encoding a full-length TAT polypeptide as disclosed herein. TAT variant polypeptides 
may be those that are encoded by a TAT variant polynucleotide. 

The term "full-length coding region" when used in reference to a nucleic acid encoding a TAT 

25 polypeptide refers to the sequence of nucleotides which encode die full-length TAT polypeptide of the invention 
(which is often shown between start and stop codons, inclusive thereof, in the accompanying figures). The term 
"full-length coding region" when used in reference to an ATCC deposited nucleic acid refers to the TAT 
polypeptide-encoding portion of the cDNA that is inserted into the vector deposited with die ATCC (which is 
often shown between start and stop codons, inclusive thereof, in the accompanying figures). 

30 "Isolated," when used to describe the various TAT polypeptides disclosed herein, means polypeptide 

that has been identified and separated and/or recovered from a component of its natural environment 
Contaminant components of its natural environment are materials that would typically interfere with diagnostic 
or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non- 
proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to 

35 obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, 
or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, 
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preferably, silver stain. Isolated polypeptide includes polypeptides situ within recombinant cells, since at least 
one component of the TAT polypeptide natural environment will not be present Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" TAT polypeptide^ncoding nucleic acid or other polypeptide-encoding nucleic acid is a 
nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with 
which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated 
poiypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. 
Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide- 
encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic 
acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express 
the polypeptide where, for example, the nucleic acid molecule is in a chromosomal location different from that 
of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells 
are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, the higher the relative temperature which 
can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions 
more stringent, while lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et al., Current Protoc ols in Molecular Biology, Wiley Interscience 
Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
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chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
denaturing agent, such as fonnamide, for example, 50% (v/v) formamide with 0.1% bovine serum 
. albumin/0.1% FicoH/0.1% polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride, 75 mM sodium citrate at 42°C; or (3) overnight hybridization in a solution that employs 50% 
fonnamide, 5 x SSC (0.75 M NaQ, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0. 1 % sodium 
5 pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 jig/ml), 0. 1% SDS, and 10% dextran 
sulfate at 42°C, with a 10 minute wash at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) followed by a 
10 minute high-stringency wash consisting of 0.1 x SSC containing EDTA at 55 °C. 

"Moderately stringent conditions* may be identified as described by Sambrook et al., Molecular 
Cloning: A Laboratory Manual. New York: Cold Spring Harbor Press, 1989, and include the use of washing 

10 solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation at 37°Cin a solution 
comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate 
(pH7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust 

15 the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a TAT 
polypeptide or anti-TAT antibody fused to a "tag polypeptide". The tag polypeptide has enough residues to 
provide an epitope against which an antibody can be made, yet is short enough such that it does not interfere 
with activity of the polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that 

20 the antibody does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at 
least six amino acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 
10 and 20 amino acid residues). 

"Active" or "activity" for the purposes herein refers to form(s) of a TAT polypeptide which retain a 
biological and/or an immunological activity of native or naturally-occurring TAT, wherein "biological" activity 

25 refers to a biological function (either inhibitory or stimulatory) caused by a native or naturally-occurring TAT 
other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native 
or naturally-occurring TAT and an "immunological" activity refers to the ability to induce the production of 
an antibody against an antigenic epitope possessed by a native or naturally-occurring TAT. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 

30 blocks, inhibits, or neutralizes a biological activity of a native TAT polypeptide disclosed herein. In a similar 
manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native TAT polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of 
native TAT polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for 

35 identifying agonists or antagonists of a TAT polypeptide may comprise contacting a TAT polypeptide with a 
candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities 
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normally associated with the TAT polypeptide. 

"Treating" or "treatment" or "alleviation" refers to both therapeutic treatment and prophylactic or 
preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition 
or disorder. Those in need of treatment include those already with the disorder as well as those prone to have 
the disorder or those in whom the disorder is to be prevented. A subject or mammal is successfully "treated" 
5 for a TAT polypeptide-expressing cancer if, after receiving a therapeutic amount of an anti-TAT antibody, TAT 
binding oligopeptide or TAT binding organic molecule according to the methods of the present invention, the 
patient shows observable and/or measurable reduction in or absence of one or more of the following: reduction 
in the number of cancer cells or absence of the cancer cells; reduction in the tumor size; inhibition (i.e., slow 
to some extent and preferably stop) of cancer cell infiltration into peripheral organs including the spread of 

10 cancer into soft tissue and bone; inhibition (i.e., slow to some extent andpreferably stop) of tumor metastasis; 
inhibition, to some extent, of tumor growth; and/or relief to some extent, one or more of the symptoms 
associated with the specific cancer; reduced morbidity and mortality, and improvement in quality of life issues. 
To die extent the anti-TAT antibody or TAT binding oligopeptide may prevent growth and/or kill existing cancer 
cells, it may be cytostatic and/or cytotoxic. Reduction of these signs or symptoms may also be felt by the 

15 patient 

The above parameters for assessing successful treatment and improvement in the disease are readily 
measurable by routine procedures familiar to a physician. For cancer therapy, efficacy can be measured, for 
example, by assessing the time to disease progression (TTP) and/or determining the response rate (RR). 
Metastasis can be determined by staging tests and by bone scan and tests for calcium level and other enzymes 

20 to determine spread to the bone. CT scans can also be done to look for spread to the pelvis and lymph nodes 
in the area. Chest X-rays and measurement of liver enzyme levels by known methods are used to look for 
metastasis to the lungs and liver, respectively. Other routine methods for monitoring the disease include 
transrectal ultrasonography (TRUS) and transrectal needle biopsy (TRNB). 

For bladder cancer, which is a more localized cancer, methods to determine progress of disease include 

25 urinary cytologic evaluation by cystoscopy, monitoring for presence of blood in the urine, visualization of the 
urothelial tract by sonography or an intravenous pyelogram, computed tomography (CT) and magnetic resonance 
imaging (MRI). The presence of distant metastases can be assessed by CT of die abdomen, chest x-rays, or 
radionuclide imaging of the skeleton. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to 

30 an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 

"Intermittent* administration is treatment that is not consecutively done without interruption, but rather is cyclic 
in nature. 

"Mammal" for purposes of the treatment of, alleviating the symptoms of or diagnosis of a cancer refers 
to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sports, or pet 
35 animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, rabbits, etc. Preferably, the ipammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
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(concurrent) and consecutive administration in any order. 

"Carriers " as used herein include pharmaceutical acceptable carriers, excipients, or stabilizers which 
are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
forming counterions such as sodium; and/or nonionic surfactants such as T WEEN® , polyethylene glycol (PEG) , 
and PLURONICS®. 

By "solid phase" or "solid support" is meant a non-aqueous matrix to which an antibody, TAT binding 
oligopeptide or TAT binding organic molecule of the present invention can adhere or attach. Examples of solid 
phases encompassed herein include those formed partially or entirely of glass (e.g., controlled pore glass), 
polysaccharides (e.g., agarose), poly acrylamides , polystyrene, polyvinyl alcohol and silicones. In certain 
embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others it is 
a purification column (e.g. , an affinity chromatography column). This term also includes a discontinuous solid 
phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is useful for delivery of a drug (such as a TAT polypeptide, an antibody thereto or a TAT binding 
oligopeptide) to a mammal. The components of die liposome are commonly arranged in a bilayer formation, 
similar to the lipid arrangement of biological membranes. 

A "small" molecule or "small" organic molecule is defined herein to have a molecular weight below 
about 500 Daltons. 

An "effective amount" of a polypeptide, antibody, TAT binding oligopeptide, TAT binding organic 
molecule or an agonist or antagonist thereof as disclosed herein is an amount sufficient to carry out a specifically 
stated purpose. An "effective amount" may be determined empirically and in a routine manner, in relation to 
the stated purpose. 

The term "therapeutically effective amount" refers to an amount of an antibody, polypeptide, TAT 
binding oligopeptide, TAT binding organic molecule or other drug effective to "treat" a disease or disorder in 
a subject or mammal. In the case of cancer, the therapeutically effective amount of the drug may reduce the 
number of cancer cells; reduce the tumor size; inhibit (i.e., slow to some extent and preferably stop) cancer cell 
infiltration into peripheral organs; inhibit (i.e., slow to some extent and preferably stop) tumor metastasis; 
inhibit, to some extent, tumor growth; and/or relieve to some extent one or more of die symptoms associated 
with the cancer. See the definition herein of "treating" . To the extent the drug may prevent growth and/or kill 
existing cancer cells, it may be cytostatic and/or cytotoxic. 

A "growth inhibitory amount" of an anti-TAT antibody, TAT polypeptide, TAT binding oligopeptide 
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or TAT binding organic molecule is an amount capable of inhibiting the growth, of a cell, especially tumor, e.g., 
cancer cell, either in vitro or in vivo. A "growth inhibitory amount" of an anti-TAT antibody, TAT 
polypeptide, TAT binding oligopeptide or TAT binding organic molecule for purposes of inhibiting neoplastic 
cell growth may be determined empirically and in a routine manner. 

A "cytotoxic amount" of an anti-TAT antibody, TAT polypeptide, TAT binding oligopeptide or TAT 
5 binding organic molecule is an amount capable of causing the destruction of a cell, especially tumor, e.g., 
cancer cell, either in vitro or in vivo, A "cytotoxic amount" of an anti-TAT antibody, TAT polypeptide, TAT 
binding oligopeptide or TAT binding organic molecule for purposes of inhibiting neoplastic cell growth may be 
determined empirically and in a routine manner. 

The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-TAT 

10 monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-TAT antibody 

compositions with polyepitopic specificity, polyclonal antibodies, single chain anti-TAT antibodies, and 
fragments of anti-TAT antibodies (see below) as long as they exhibit the desired biological or immunological 
activity. The term "immunoglobulin" (Ig) is used interchangeable with antibody herein. 

An "isolated antibody" is one which has been identified and separated and/or recovered from a 

1 5 component of its natural environment. Contaminant components of its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for die antibody, and may include enzymes, hormones, and 
other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) 
to greater than 95% by weight of antibody as determined by the Lowry method, and most preferably more than 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 

20 sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or 
nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the 
antibody in situ within recombinant cells since at least one component of the antibody's natural environment will 
not be present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

The basic 4-chain antibody unit is a heterotetrameric glycoprotein composed of two identical ligjit (L) 

25 chains and two identical heavy (H) chains (an IgM antibody consists of 5 of the basic heterotetramer unit along 
with an additional polypeptide called J chain, and therefore contain 10 antigen binding sites, while secreted IgA 
antibodies can polymerize to form polyvalent assemblages comprising 2-5 of the basic 4-chain units along with 
J chain). In die case of IgGs, the 4-chain unit is generally about 150,000 daltons. Each L chain is linked to 
a H chain by one covalent disulfide bond, while the two H chains are linked to each other by one or more 

30 disulfide bonds depending on the H chain isotype. Each H and L chain also has regularly spaced intrachain 
disulfide bridges. Each H chain has at the N-terminus, a variable domain (V h) followed by three constant 
domains (C H ) for each of the a and y chains and four C H domains for u. and € isotypes. Each L chain has at the 
N-terminus, a variable domain (VJ followed by a constant domain (CJ at its other end. The V L is aligned with 
the V H and the Cl is aligned wife the first constant domain of the heavy chain (C H l). Particular amino acid 

35 residues are believed to form an interface between the light chain and heavy chain variable domains. The 
pairing of a V H and V L together forms a single antigen-binding site. For the structure and properties of the 
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different classes of antibodies, see, e.g., Basic and Cli™™! Immunology, 8th edition, Daniel P. Stites, Abba 
I. Terr and Tristram G. Parslow (eds.), Appleton & Lange, Norwalk, CT, 1994, page 71 and Chapter 6. 

The L chain from any vertebrate species can be assigned to one of two clearly distinct types, called 
kappa and lambda, based on the amino acid sequences of their constant domains. Depending on the amino acid 
sequence of the constant domain of iheir heavy chains (C H ), immunoglobulins can be assigned to different 
classes or isotypes. There are five classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, having heavy 
chains designated a, 5, e, y, and p, respectively. The y and a classes are further divided into subclasses on the 
basis of relatively minor differences in C H sequence and function, e.g. , humans express the following subclasses: 
IgGl, IgG2, IgG3, IgG4, IgAl, and IgA2. 

The term "variable" refers to the fact that certain segments of the variable domains differ extensively 
in sequence among antibodies. The V domain mediates antigen binding and define specificity of a particular 
antibody for its particular antigen. However, the variability is not evenly distributed across the 1 10-amino acid 
span of the variable domains. Instead, the V regions consist of relatively invariant stretches called framework 
regions (FRs) of 15-30 amino acids separated by shorter regions of extreme variability called "hypervariable 
regions" that are each 9-12 amino acids long. The variable domains of native heavy and light chains each 
comprise four FRs, largely adopting a p-sheet configuration, connected by three hypervariable regions, which 
form loops connecting, and in some cases forming part of, the p-sheet structure. The hypervariable regions in 
each chain are held together in close proximity by the FRs and, with the hypervariable regions from the other 
chain, contribute to the formation of the antigen-binding site of antibodies (see Kabat et al., Sequences of 
PmteiTi* nf TmTnimological Interest. 5th Ed. Public Health Service, National Institutes of Health, Bethesda, MD. 
(1991)). The constant domains arc not involved directly in binding an antibody to an antigen, but exhibit various 
effector functions, such as participation of die antibody in antibody dependent cellular cytotoxicity (ADCC). 

The term "hypervariable region" when used herein refers to the amino acid residues of an antibody 
which are responsible for antigen-binding. The hypervariable region generally comprises amino acid residues 
from a "complementarity determining region" or "CDR" (e.g. around about residues 24-34 (LI), 50-56 (L2) 
and 89-97 (L3) in the V L , and around about 1-35 (HI), 50-65 (H2) and 95-102 (H3)inthe V H ; Kabat et al., 
Sequences of Proteins of Im munological interest. 5th Ed. Public Health Service, National Institutes of Health, 
Bethesda, MD. (199 1)) and/or those residues from a "hypervariable loop" (e\ g. residues 26-32 (LI), 50-52 (L2) 
and 91-96 (L3)inthe V L , and 26-32 (HI), 53-55 (H2) and 96-101 (H3) in the V„; Chothia and Lesk J. Mol. 
Biol. 196:901-917(1987)). 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 
except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies 
are highly specific, being directed against a single antigenic site. Furthermore, in contrast to polyclonal 
antibody preparations which include different antibodies directed against different determinants (epitopes), each 
monoclonal antibody is directed against a single determinant on the antigen. In addition to their specificity, the 
monoclonal antibodies are advantageous in that they may be synthesized nncontaminated by other antibodies. 
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The modifier "monoclonal" is not to be construed as requiring production of the antibody by any particular 
method. For example, the monoclonal antibodies useful in the present invention may be prepared by the 
hybridoma methodology first described by Kohler et al., Nature. 256:495 (1975), or may be made using 
recombinant DNA methods in bacterial, eukaryotic animal orplant cells (see, e.g. , U.S. Patent No. 4,816,567). 
The "monoclonal antibodies" may also be isolated from phage antibody libraries using the techniques described 
5 in Clackson et al., Nature, 352:624-628 (1991) and Marks et al., J. Mol. Biol. , 222:581-597 (1991), for 
example. 

The monoclonal antibodies herein include "chimeric" antibodies in which a portion of the heavy and/or 
light chain is identical with or homologous to corresponding sequences in antibodies derived from a particular 
species or belonging to a particular antibody class or subclass, while the remainder of the chain(s) is identical 

10 with or homologous to corresponding sequences in antibodies derived from another species or belonging to 
another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the desired 
biological activity (see U.S. Patent No. 4,816,567; and Morrison et al., Proc. Natl. Acad. Sci. USA. 81:6851- 
6855 (1984)). Chimeric antibodies of interest herein include "primatized* antibodies comprising variable domain 
antigen-binding sequences derived from a non-human primate (e.g. Old World Monkey, Ape etc), and human 

15 constant region sequences. 

An "intact" antibody is one which comprises an antigen-binding site as well as a Q and at least heavy 
chain constant domains, C H 1, C H 2 and C H 3. The constant domains may be native sequence constant domains 
(e.g. human native sequence constant domains) or amino acid sequence variant thereof. Preferably, the intact 
antibody has one or more effector functions. 

20 "Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 

variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab') 2 , and Fv 
fragments; diabodies; linear antibodies (see U.S. Patent No. 5,641,870, Example 2; Zapata et al., Protein Eng. 
8(10): 1057-1062 [1995]); single-chain antibody molecules; and multispecific antibodies formed from antibody 
fragments. * 

25 Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 

fragments, and a residual "Fc" fragment, a designation reflecting the ability to crystallize readily. The Fab 
fragment consists of an entire L chain along with the variable region domain of the H chain (V H ), and the first 
constant domain of one heavy chain (C H 1). Each Fab fragment is monovalent with respect to antigen binding, 
i.e., it has a single antigen-binding site. Pepsin treatment of an antibody yields a single large F(ab^ fragment 

30 which roughly corresponds to two disulfide linked Fab fragments having divalent antigen-binding activity and 
is still capable of cross-linking antigen. Fab' fragments differ from Fab fragments by having additional few 
residues at the carboxy terminus of the C H 1 domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab^ antibody fragments originally were produced as pairs of Fab ' fragments which have 

35 hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The Fc fragment comprises the carboxy-terminal portions of both H chains held together by disulfides. 
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The effector functions of antibodies are determined by sequences in the Fc region, which region is also the part 
recognized by Fc receptors (FcR) found on certain types of cells. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding 
site. This fragment consists of a dimer of one heavy- and one light-chain variable region domain in tight, non- 
covalent association. From the folding of these two domains emanate six hypervariable loops (3 loops each from 
the H and L chain) that contribute the amino acid residues for antigen binding and confer antigen binding 
specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the 
entire binding site. 

"Single-chain Fv" also abbreviated as "sFv n or "scFv" are antibody fragments that comprise the V H 
and V L antibody domains connected into a, single polypeptide chain. Preferably, the sFv polypeptide further 
comprises a polypeptide linker between the V H and V L domains which enables the sFv to form the desired 
structure for antigen binding. For a review of sFv, see Piuckthun in The Pharmacology of Monoclonal, 
Antibodies, vol. 113, Rosenhurg and Moore eds., Springer-Verlag, New York, pp. 269-315 (1994); Borrebaeck 
1995, infra. 

The term "diabodies" refers to small antibody fragments prepared by constructing sFv fragments (see 
preceding paragraph) with short linkers (about 5-10 residues) between the V H and V L domains such that inter- 
chain but not intra-chain pairing of the V domains is achieved, resulting in a bivalent fragment, i.e., fragment 
having two antigen-binding sites. Bispecific diabodies are heterodimers of two "crossover" sFv fragments in 
which the V tt and V L domains of the two antibodies are present on different polypeptide chains. Diabodies are 
described more fully in, for example, EP 404,097; WO 93/11161; andHollingeretal., Proc. Natl. Acad. ScL 
USA, 90:6444-6448 (1993). 

"Humanized" forms of non-human (e.g., rodent) antibodies are chimeric antibodies that contain 
minimal sequence derived from the non-human antibody. For the most part, humanized antibodies are human 
immunoglobulins (recipient antibody) in which residues from a hypervariable region of the recipient are replaced 
by residues from a hypervariable region of a non-human species (donor antibody) such as mouse, rat, rabbit or 
non-human primate having the desired antibody specificity, affinity, and capability. In some instances, 
framework region (FR) residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Furthermore, humanized antibodies may comprise residues that are not found in the recipient antibody 
or in the donor antibody. These modifications are made to further refine antibody performance. In general, 
the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the hypervariable loops correspond to those of a non-human immunoglobulin 
and all or substantially all of the FRs are those of a human immunoglobulin sequence. The humanized antibody 
optionally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a 
human immunoglobulin. For further details, see Jones et al., Nature 321:522-525 (1986); Riechmann et al., 
Nature 332:323-329 (1988); and Presta, Curr. Oo. Struct. Biol. 2:593-596 (1992). 

A "species-dependent antibody," e.g., a mammalian anti-human IgE antibody, is an antibody which 
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has a stronger binding affinity for an antigen from a first mammalian species than it has for a homologue of that 
antigen from a second mammalian species. Normally, the species-dependent antibody "bind specifically" to 
a human antigen (i.e., has a binding affinity (Kd) value of no more than about 1 x 10~ 7 M, preferably no more 
than about 1 x 10* and most preferably no more than about 1 x 10" 9 M) but has a binding affinity for a 
homologue of the antigen from a second non-human mammalian species which is at least about 50 fold, or at 
least about 500 fold, or at least about 1000 fold, weaker than its binding affinity for die human antigen. The 
species-dependent antibody can be of any of the various types of antibodies as defined above, but preferably is 
a humanized or human antibody. 

A "TAT binding oligopeptide" is an oligopeptide that binds, preferably specifically, to a TAT 
polypeptide as described herein. TAT binding oligopeptides may be chemically synthesized using known 
oligopeptide synthesis methodology or may be prepared and purified using recombinant technology. TAT 
binding oligopeptides are usually at least about 5 amino acids in length, alternatively at least about 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 
38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 
66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, or 100 amino acids in length or more, wherein such oligopeptides that are capable of 
binding, preferably specifically, to a TAT polypeptide as described herein. TAT binding oligopeptides may be 
identified without undue experimentation using well known techniques . In this regard, it is noted that techniques 
for screening oligopeptide libraries for oligopeptides that are capable of specifically binding to a polypeptide 
target are well known in the art (see, e.g., U.S. Patent Nos. 5,556,762, 5,750,373, 4,708,871, 4,833,092, 
5,223,409, 5,403,484, 5,571,689, 5,663,143; PCT Publication Nos. WO 84/03506 and WO84/03564; Geysen 
et al., Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Geysen et al., Proc. Natl. Acad. ScL U.S.A., 
82:178-182 (1985); Geysen et al., in Synthetic Peptides as Antigens, 130-149 (1986); Geysen et al., J. 
Immunol. Meth., 102:259-274 (1987); Schoofc et al., J. Immunol., 140:611-616 (1988), Cwirla, S. E. et al. 
(1990) Proc. Nad. Acad. Sci. USA, 87:6378; Lowman, H.B. et al. (1991) Biochemistry, 30:10832; Clackson, 
T. et al. (1991) Nature, 352: 624; Marks, J. D. etal. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) 
Proc. Natl. Acad. Sci. USA, 88:8363, and Smith, G. P. (1991) Current Opin. Biotechnol., 2:668). 
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A "TAT binding organic molecule" is an organic molecule other than an oligopeptide or antibody as 
defined herein that binds, preferably specifically, to a TAT polypeptide as described herein. TAT binding 
organic molecules may be identified and chemically synthesized using known methodology (see, e.g., PCT 
Publication Nos. WO00/00823 and WO00/39585). TAT binding organic molecules are usually less than about 
2000 daltons in size, alternatively less than about 1500, 750, 500, 250 or 200 daltons in size, wherein such 
5 organic molecules that are capable of binding, preferably specifically, to a TAT polypeptide as described herein 
may be identified without undue experimentation using well known techniques. In this regard, it is noted that 
techniques for screening organic molecule libraries for molecules that are capable of binding to a polypeptide 
target are well known in the art (see, e.g., PCT Publication Nos. WOOO/00823 and WO00/39585). 

An antibody, oligopeptide or other organic molecule "whichbinds w an antigen of interest, e.g. atumor- 

1 0 associated polypeptide antigen target, is one that binds the antigen with sufficient affinity such that the antibody, 
oligopeptide or other organic molecule is useful as a diagnostic and/or therapeutic agent in targeting a cell or 
tissue expressing the antigen, and does not significantly cross-react with other proteins. In such embodiments, 
the extent of binding of the antibody, oligopeptide or other organic molecule to a "non-target* protein will be 
less than about 10% of the binding of the antibody, oligopeptide or other organic molecule to its particular target 

15 protein as determined by fluorescence activated cell sorting (FACS) analysis or radloimmunoprecipitation (RIA). 
With regard to the binding of an antibody, oligopeptide or other organic molecule to a target molecule, the term 
"specific binding" or "specifically binds to" or is "specific for" a particular polypeptide or an epitope on a 
particular polypeptide target means binding that is measurably different from a non-specific interaction. Specific 
binding can be measured, for example, by detenriining binding of a molecule compared to binding of a control 

20 molecule, which generally is a molecule of similar structure that does not have binding activity. For example, 
specific binding can be determined by competition with a control molecule that is similar to the target, for 
example, an excess of non-labeled target In this case, specific binding is indicated if the binding of the labeled 
target to a probe is competitively inhibited by excess unlabeled target. The term "specific binding" or 
"specifically binds to" or is "specific for" a particular polypeptide or an epitope on a particular polypeptide 

25 target as used herein can be exhibited, for example, by a molecule having a Kd for the target of at least about 
10 4 M, alternatively at least about 10" 5 M, alternatively at least about 10 6 M, alternatively at least about 10* 7 
M, alternatively at least about KT 8 M, alternatively at least about 1(T 9 M, alternatively at least about 10 10 M, 
alternatively at least about 10 u M, alternatively at least about 10 42 M, or greater. In one embodiment, the term 
"specific binding" refers to binding where a molecule binds to a particular polypeptide or epitope on a particular 

30 polypeptide without substantially binding to any other polypeptide or polypeptide epitope. 

An antibody, oligopeptide or other organic molecule that "inhibits the growth of tumor cells expressing 
a TAT polypeptide" or a "growth inhibitory" antibody, oligopeptide or other organic molecule is one which 
results in measurable growth inhibition of cancer cells expressing or overexpressing the appropriate TAT 
polypeptide. The TAT polypeptide may be a transmembrane polypeptide expressed on the surface of a cancer 

35 cell or may be a polypeptide that is produced and secreted by a cancer cell. Preferred growth inhibitory anti- 
TAT antibodies, oligopeptides or organic molecules inhibit growth of TAT-expressing tumor cells by greater 
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than 20%, preferably from about 20% to about 50%, and even more preferably, by greater than 50% (e.g., 
from about 50% to about 100%) as compared to the appropriate control, the control typically being tumor cells 
not treated with the antibody, oligopeptide or other organic molecule being tested. In one embodiment, growth 
inhibition can be measured at an antibody concentration of about 0.1 to 30 ng/ml or about 0.5 nM to 200 nM 
in cell culture, where the growth inhibition is determined 1-10 days after exposure of the rumor cells to the 
5 antibody. Growth inhibition of tumor cells in vivo can be determined in various ways such as is described in 
the Experimental Examples section below. The antibody is growth inhibitory in vivo if administration of the 
anti-TAT antibody at about 1 fig/kg to about 100 mg/kg body weight results in reduction in tumor size or tumor 
cell proliferation within about 5 days to 3 months from the first administration of the antibody, preferably within 
about 5 to 30 days. 

10 An antibody, oligopeptide or other organic molecule which "induces apoptosis" is one which induces 

programmed cell death as determined by binding of annexin V, fragmentation of DNA, cell shrinkage, dilation 
of endoplasmic reticulum, cell fragmentation, and/or formation of membrane vesicles (called apoptotic bodies). 
The cell is usually one which overexpresses a TAT polypeptide. Preferably the cell is a tumor cell, e.g., a 
prostate, breast, ovarian, stomach, endometrial, lung, kidney, colon, bladder cell. Various methods are 

15 available for evaluating the cellular events associated with apoptosis. For example, phosphatidyl serine (PS) 
translocation can be measured by annexin binding; DNA fragmentation can be evaluated through DNA 
laddering; and nuciear/chromatin condensation along with DNA fragmentation can be evaluated by any increase 
in hypodiploid cells. Preferably, the antibody, oligopeptide or other organic molecule which induces apoptosis 
is one which results in about 2 to 50 fold, preferably about 5 to 50 fold, and most preferably about 10 to 50 fold, 

20 induction of annexin binding relative to untreated cell in an annexin binding assay. 

Antibody "effector functions * refer to those biological activities attributable to the Fc region (a native 
sequence Fc region or amino acid sequence variant Fc region) of an antibody, and vary with the antibody 
isotype. Examples of antibody effector functions include: Clq binding and complement dependent cytotoxicity; 
Fc receptor binding; antibody-dependent cell-mediated cytotoxicity (ADCC); phagocytosis; down regulation of 

25 cell surface receptors (e.g., B cell receptor); and B cell activation. 

"Antibody-dependent cell-mediated cytotoxicity" or "ADCC" refers to a form of cytotoxicity in which 
secreted Ig bound onto Fc receptors (FcRs) present on certain cytotoxic cells (e.g., Natural Killer (NK) cells, 
neutrophils, and macrophages) enable these cytotoxic effector cells to bind specifically to an antigen-bearing 
target cell and subsequently kill the target cell with cytotoxins. The antibodies "arm" the cytotoxic cells and 

30 are absolutely required for such killing. The primary ceUs for mediating ADCC, NK cells, express Fc yRIII 
only, whereas monocytes express FcyRI, FcyRII and FcyRIII. FcR expression on hematopoietic cells is 
summarized in Table 3 on page 464 of Ravetch and Kinet, Armu. Rev. Immunol. 9:457-92 (1991). To assess 
ADCC activity of a molecule of interest, an in vitro ADCC assay, such as that described in US Patent No. 
5,500,362 or 5,821,337 may be performed. Useful effector cells for such assays include peripheral blood 

35 mononuclear cells (PBMC) and Natural Killer (NK) cells. Alternatively, or additionally, ADCC activity of the 
molecule of interest may be assessed in vivo, e.g., in a animal model such as that disclosed in Clynes et al. 
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(USA) 95:652-656 (1998). 

"Fc receptor" or "FcR* describes a receptor that binds to the Fc region of an antibody. The preferred 
FcR is a native sequence human FcR. Moreover, a preferred FcR is one which binds an IgG antibody (a gamma 
receptor) and includes receptors of the FcyRI, FcyRII and FcyRIII subclasses, including allelic variants and 
alternatively spliced forms of these receptors. FcyRII receptors include FcyRHA (an "activating receptor") and 
FcyRIIB (an "inhibiting receptor"), which have similar amino acid sequences that differ primarily in the 
cytoplasmic domains thereof. Activating receptor Fc yRIIA contains an immunoreceptor tyrosine-based 
activation motif (ITAM) in its cytoplasmic domain. Inhibiting receptor FcyRIIB contains an immunoreceptor 
tyrosine-based inhibition motif (IT1M) in its cytoplasmic domain, (see review M. in Daeron, Annu. Rev. 
Immunol. 15:203-234 H99TO. FcRs are reviewed in Ravetch and Kinet, Aimn. Rev Immunol. 9:457-492 
(1991); Capel et al., Trnmimnmethnds 4:25-34 (1994); and de Haas et al., J. Lab. Clin. Med. 126:330-41 
(1995). Other FcRs, including those to be identified in the future, are encompassed by the term "FcR" herein. 
The term also includes the neonatal receptor, FcRn, which is responsible for the transfer of maternal IgGs to 
the fetus (Guyer et al., J. Immunol. 117:587 (1976) and Kim et al., J. Immunol. 24:249 (1994)). 

"Human effector cells " are leukocytes which express one or more FcRs and perform effector functions. 
Preferably, the cells express at least Fc yRIII and perform ADCC effector function. Examples of human 
leukocytes which mediate ADCC include peripheral blood mononuclear cells (PBMC), natural killer (NK) cells, 
monocytes, cytotoxic T cells and neutrophils; with PBMCs and NK cells being preferred. The effector cells may 
be isolated from a native source, e.g., from blood. 

"Complement dependent cytotoxicity" or "CDC" refers to the lysis of a target cell in the presence of 
complement. Activation of the classical complement pathway is initiated by the binding of the first component 
of the complement system (Clq) to antibodies (of the appropriate subclass) which are bound to their cognate 
antigen. To assess complement activation, a CDC assay, e.g., as described in Gazzano-Santoro et al., L 
Immunol. Methods 202:163 (1996), may be performed. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in mammals that 
is typically characterized by unregulated cell growth. Examples of cancer include, but are not limited to, 
carcinoma, lymphoma, blastema, sarcoma, and leukemia or lymphoid malignancies. More particular examples 
of such cancers include squamous cell cancer (e.g., epithelial squamous cell cancer), lung cancer including 
small-cell lung cancer, non-small cell lung cancer, adenocarcinoma of the lung and squamous carcinoma of the 
lung, cancer of the peritoneum, hepatocellular cancer, gastric or stomach cancer including gastrointestinal 
cancer, pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer 
of die urinary tract, hepatoma, breast cancer, colon cancer, rectal cancer, colorectal cancer, endometrial or 
uterine carcinoma, salivary gland carcinoma, kidney or renal cancer, prostate cancer, vulval cancer, thyroid 
cancer, hepatic carcinoma, anal carcinoma, penile carcinoma, melanoma, multiple myeloma and B-cell 
lymphoma, brain, as well as head and neck cancer, and associated metastases. 

The terms "cell proliferative disorder" and "proliferative disorder* refer to disorders that are 
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associated with some degree of abnormal cell proliferation. In one embodiment, the cell proliferative disorder 
is cancer. 

"Tumor", as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or 
benign, and all pre-cancerous and cancerous cells and tissues. 

An antibody, oligopeptide or other organic molecule which "induces cell death" is one which causes 
a viable cell to become nonviable. The cell is one which expresses a TAT polypeptide, preferably a cell that 
overexpresses a TAT polypeptide as compared to a normal cell of the same tissue type. The TAT polypeptide 
may be a transmembrane polypeptide expressed on the surface of a cancer cell or may be a polypeptide that is 
produced and secreted by a cancer cell. Preferably, the cell is a cancer cell, e.g., a breast, ovarian, stomach, 
endometrial, salivary gland, lung, kidney, colon, thyroid, pancreatic or bladder cell. Cell death in vitro may 
be determined in the absence of complement and immune effector cells to distinguish cell death induced by 
antibody-dependent cell-mediated cytotoxicity (ADCC) or complement dependent cytotoxicity (CDC). Thus, 
(he assay for cell death may be performed using heat inactivated serum (i.e. , in the absence of complement) and 
in die absence of immune effector cells. To determine whether the antibody, oligopeptide or other organic 
molecule is able to induce cell death, loss of membrane integrity as evaluated by uptake of propidium iodide 
(PI), trypan blue (see Moore et aL Cvtotechnotogy 17:1-11 (1995)) or 7AAD can be assessed relative to 
untreated cells. Preferred cell death-inducing antibodies, oligopeptides or other organic molecules are those 
which induce PI uptake in the PI uptake assay in BT474 cells. 

A w TAT-expressing cell' 7 is a cell which expresses an endogenous or transfected TAT polypeptide 
either on the cell surface or in a secreted form. A "TAT-expressing cancer" is a cancer comprising cells that 
have a TAT polypeptide present on the cell surface or that produce and secrete a TAT polypeptide. A "TAT- 
expressing cancer" optionally produces sufficient levels of TAT polypeptide on die surface of cells thereof, such 
that an anti-TAT antibody, oligopeptide ot other organic molecule can bind thereto and have a therapeutic effect 
with respect to the cancer. In another embodiment, a "TAT-expressing cancer* optionally produces and 
secretes sufficient levels of TAT polypeptide, such that an anti-TAT antibody, oligopeptide ot other organic 
molecule antagonist can bind thereto and have a therapeutic effect with respect to the cancer. With regard to 
the latter, the antagonist may be an antisense oligonucleotide which reduces, inhibits or prevents production and 
secretion of die secreted TAT polypeptide by tumor cells. A cancer which "overexpresses" a TAT polypeptide 
is one which has significantly higher levels of TAT polypeptide at the cell surface thereof, or produces and 
secretes, compared to a noncancerous cell of the same tissue type . Such overexpression may be caused by gene 
amplification or by increased transcription or translation. TAT polypeptide overexpression may be determined 
in a diagnostic or prognostic assay by evaluating increased levels of the TAT protein present on die surface of 
a cell, or secreted by die cell (e.g., via an uumunohistochemistry assay using anti-TAT antibodies prepared 
against an isolated TAT polypeptide which may be prepared using recombinant DNA technology from an 
isolated nucleic acid encoding the TAT polypeptide; FACS analysis, etc.). Alternatively, or additionally, one 
may measure levels of TAT polypeptide-encoding nucleic acid or mRNA in the cell, e.g., via fluorescent in situ 
hybridization using a nucleic acid based probe corresponding to a TAT-encoding nucleic acid or the complement 
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thereof; (FISH; see W098/45479 published October, 1998), Southern blotting, Northern blotting, orpolymerase 
chain reaction (PCR) techniques, such as real time quantitative PCR (RT-PCR). One may also study TAT 
polypeptide overexpression by measuring shed antigen in a biological fluid such as serum, e.g, using antibody- 
based assays (see also, e.g., U.S. Patent No. 4,933,294 issued June 12, 1990; WO91/05264 published April 
18, 1991; U.S. Patent 5,401,638 issued March 28, 1995; and Sias et al., J. Immunol Melinite 132:73-80 
(1990)). Aside from the above assays, various in vivo assays are available to the skilled practitioner. For 
example, one may expose cells within the body of the patient to an antibody which is optionally labeled with 
a detectable label, e.g. , a radioactive isotope, and binding of the antibody to cells in the patient can be evaluated, 
e.g., by external scanning for radioactivity or by analyzing a biopsy taken from a patient previously exposed 
to the antibody. 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
banding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least die binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
innnunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 
or IgM. 

The word "label" when used herein refers to a detectable compound or composition which is conjugated 
direcdy or indirectly to the antibody, oligopeptide or other organic molecule so as to generate a "labeled" 
antibody, oligopeptide or other organic molecule. The label may be detectable by itself (e. g. radioisotope labels 
or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration of a substrate 
compound or composition which is detectable. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function 
of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes (e.g., ?ti f I 131 , 
I 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 and radioactive isotopes of Lu), cheinotherapeutic agents e.g. 
methotrexate, adriamicin, vinca alkaloids (vincristine, vinblastine, etoposide), doxorubicin, melphalan, 
mitomycin C, chlorambucil, daunorubicin or other intercalating agents, enzymes and fragments thereof such 
as nucleolytic enzymes, antibiotics, and toxins such as small molecule toxins or enzymatically active toxins of 
bacterial, fungal, plant or animal origin, including fragments and/or variants thereof, and die various antitumor 
or anticancer agents disclosed below. Other cytotoxic agents are described below. A tumoricidal agent causes 
destruction of tumor cells. 

A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits 
growth of a cell, especially a TAT-expressing cancer cell, either in vitro or in vivo. Thus, the growth inhibitory 
agent may be one which significantly reduces die percentage of TAT-expressing cells in S phase. Examples of 
growth inhibitory agents include agents that block cell cycle progression (at a place other than S phase), such 
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as agents that induce Gl arrest and M-phase arrest. Classical M-phase blockers include the vincas (vincristine 
and vinblastine), taxanes, and topoisomerase II inhibitors such as doxorubicin, epirubicin, daunorubicin, 
etoposide, and bleomycin. Those agents that arrest Gl also spill over into S-phase arrest, for example, DNA 
alkylating agents such as tamoxifen, prednisone, dacarbazine, mechlorethamine, cisplatin, methotrexate, 5- 
fluorouracil, and ara-C. Further information can be found irfThe Molecular Basis of Cancer. Mendelsohn and 
5 Israel, eds., Chapter 1, entitled "Cell cycle regulation, oncogenes, and antineoplastic drugs" by Murakami et 
al. (WB Saunders: Philadelphia, 1995), especially p. 13. The taxanes (paclitaxel and docetaxel) are anticancer 
drugs both derived from the yew tree. Docetaxel (TAXOTERE® Rhone-Poulenc Rorer), derived from the 
European yew, is a semisynthetic analogue of paclitaxel (TAXOL®, Bristol-Myers Squibb). Paclitaxel and 
docetaxel promote the assembly of microtubules from tubulin dimers and stabilize microtubules by preventing 

1 0 depolymerization, which results in the inhibition of mitosis in cells. 

"Doxorubicin" is an anthracycline antibiotic. The full chemical name of doxorubicin is (8S~cis)-10-[(3- 
amirio-2,3,6-trideoxy«^ 
methoxy-5,12-naphtharenedione. 

The term "cytokine" is a generic term forproteins released by one cell population which act on another 

15 cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, and traditional 

polypeptide hormones. Included among the cytokines are growth hormone such as human growth hormone, N- 
methionyi human growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; 
proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid 
stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; fibroblast growth factor; 

20 prolactin; placental lactogen; tumor necrosis factor-a and -P; mullerian-inhibiting substance; mouse 
gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; integrin; thrombopoietin 
(TPO); nerve growth factors such as NGF-p; platelet-growth factor; transforming growth factors (TGFs) such 
as TGF-a and TGF- p; insulin-lifee growth factor-I and -II; erythropoietin (EPO); osteoinductive factors; 
interferons such as interferon -a, -p, and -y; colony stimulating factors (CSFs) such as macrophage-CSF (M- 

25 CSF); granulocyte-macrophage-CSF (GM-CSF); andgranulocyte-CSF (G-CSF); interleukins (ELs) such as EL-1, 
n> la, IL-2, IL-3, IL-4, IL-5, IL-6, H^7, IL-8, IL-9, IL-12; a tumor necrosis factor such as TNF-ot 

or TNF-B; and other polypeptide factors including LIF and kit ligand (KL) . As used herein, the term cytokine 
includes proteins from natural sources or from recombinant cell culture and biologically active equivalents of 
the native sequence cytokines. 

30 
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The term "package insert" is used to refer to instructions customarily included in commercial packages 
of therapeutic products, that contain information about the indications, usage, dosage, administration, 
contraindications and/or warnings concerning the use of such therapeutic products. 
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Table 1 

/* ; 
* 

* C-C increased from 12 to 15 

* Z is average of EQ 
5 * Bis average of ND 

* match with stop is _M; stop-stop = 0; J (joker) match = 0 
*/ 

#deflne _M -8 /* value of a match with a stop */ 
10 hit day[26][26] = { 

/* a'bcdefghijklmnopqrstuvwxyz*/ 

/* A */ { 2, 0,-2, 0, 0,-4. 1,-1,-1. 0.-1.-2,-1, 0,_M. 1, 0,-2, 1. 1, 0, 0,-6, 0,-3, 0}, 

/* B */ { 0, 3,-4, 3, 2,-5. 0, 1.-2, 0, 0,-3,-2, 2,_M.-1, 1, 0, 0, 0, 0,-2,-5, 0,-3. 1}, 

/* C */ {-2,-4,15,-5.-5,-4,-3,-3,-2, 0,-5,-6,-5,-4._M.-3.-5.-4. 0,-2, 0,-2,-8, 0, 0,-5}, 

15 /* D */ {0, 3,-5, 4, 3,-6, 1. 1,-2, 0. 0,4,-3, 2,_M,-1, 2,-1, 0. 0, 0,-2,-7, 0,-4, 2}, 

/* E */ { 0, 2.-5, 3. 4,-5, 0, 1,-2, 0. 0,-3.-2. l._M.-l. 2.-1. 0. 0, 0,-2,-7, 0,-4, 3}, 

/* p */ {-4,-5.-4,-6,-5. 9,-5,-2. 1. 0,-5, 2, 0,-4,_M.-5,-5.-4,-3.-3, 0,-1, 0, 0, 7,-5}, 

/* G */ { 1, 0,-3. 1. 0,-5, 5,-2,-3, 0.-2,-4,-3, 0,_M.-l,-l.-3, 1, 0, 0.-1.-7. 0.-5. 0}. 

/* H ♦/ {-1. 1.-3. 1. 1,-2.-2, 6.-2. 0. 0,-2,-2, 2,_M. 0, 3, 2.-1.-1, 0.-2,-3, 0, 0. 2} 

20 /*I*/ {-1,-2.-2,-2.-2. 1,-3.-2, 5, 0.-2, 2, 2,-2,_M,-2,-2,-2,-l, 0.0. 4.-5. 0.-1.-2}. 

/*J*/ {0,0,0,0.0,0.0.0.0.0.0.0.0,0,_M, 0.0, 0,0, 0,0, 0,0, 0,0,0}, 

/* K */ {-1, 0.-5, 0. 0.-5.-2. 0,-2, 0, 5,-3, 0. 1,_M.-1, 1. 3, 0. 0. 0.-2,-3. 0.-4. 0} 

I* L ♦/ {-2,-3,-6,-4.-3, 2.-4,-2. 2, 0.-3. 6. 4,-3,_M,-3,-2,-3,-3,-l, 0, 2,-2. 0,-1,-2}, 

/* M */ {-1,-2,-5.-3.-2. 0.-3,-2. 2, 0. 0. 4, 6,-2,_M,-2.-l, 0,-2.-1, 0, 2,4. 0,-2,-1}, 

25 /* N */ {0, 2.4, 2. 1,4. 0. 2.-2, 0. 1.-3.-2. 2,_M.-1, 1. 0, 1, 0, 0.-2.4 0,-2. 1}. 

1*0*1 { M M,_M,_M,_M._M,_M,_M, M,_M,_M,_M,_M,_M, 0,_M._M._M,_M,_M,_M._M._M,_M,_M._B*. 

/* P */ { 1,-1,-3,-1,-1,-5,-1, 0.-2, 0.-l,-3.-2,-l,_M. 6. 0. 0. 1, 0, 0.-1,-6, 0,-5, 0}, 

/* Q */ { 0, 1.-5. 2, 2.-5.-1. 3,-2, 0. 1,-2.-1. l._M. 0. 4. 1,-1.-1, 0.-2,-5, 0,4. 3}, 

/* R »/ {-2, 0,4,-1.-1,4,-3, 2.-2, 0, 3,-3. 0, 0._M, 0. 1. 6, 0,-1, 0.-2, 2, 0,4. 0}, 

30 /* S */ { 1. 0. 0. 0. 0.-3. 1.-1.-1, 0, 0,-3,-2, l._M, 1,-1, 0, 2, 1, 0,-1,-2. 0,-3, 0}, 

/* T */ { 1, 0.-2, 0. 0,-3, 0,-1, 0, 0. 0,-1.-1. 0,_M, 0,-1,-1. 1. 3, 0. 0,-5. 0,-3, 0}, 

/* U */ { 0, 0. 0, 0. 0, 0. 0. 0, 0. 0. 0, 0, 0, 0,_M. 0, 0. 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* V */ { 0,-2,-2,-2,-2,-1.-1,-2. 4. 0,-2, 2. 2,-2,_M,-l .-2,-2.-1. 0, 0, 4,-6. 0.-2,-2}. 

/* W */ {-6,-5,-8.-7,-7. 0.-7,-3.-5, 0.-3.-2,4.4,_M,-6,-5, 2,-2,-5, 0,-6,17, 0, 0,-6}, 

35 /* X */ { 0. 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M. 0, 0. 0, 0. 0, 0, 0, 0. 0, 0. 0}. 

/* Y */ {-3,-3, 0.4,4. 7.-5. 0.-1. 0.4.-l,-2.-2,_M.-5,4.4.-3,-3, 0,-2. 0, 0,10,4}. 

/* Z */ { 0, 1.-5, 2. 3,-5. 0, 2.-2, 0. 0,-2,-1. 1,_M, 0, 3, 0, 0. 0, 0.-2.-6, 0,4. 4} 



40 



45 



50 



}; 
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Table 1 fcont'l 



/* 
*/ 



tfincluae <stdio.n> 






^include <ctype.h> 






#define MAXJMP 


16 


/* max jumps in a diag */ 


#define MAXGAP 


24 


/* don't continue to penalize gaps larger than this */ 


#aeOne JMPS 


1024 


/* max jmps in an path */ 


#define MX 


4 


/* save if there's at least MX-1 bases since last jmp */ 


#define DMAT 


3 


/* value of matching bases */ 


Mefine DMIS 


0 


/* penalty for mismatched bases */ 


#defme DINSO 


8 


/* penalty for a gap */ 


^define DINS1 


1 


/* penalty per base */ 


^define PINSO 


8 


/* penalty for a gap */ 


#define PINS1 


4 


/* penalty per residue */ 


struct jmp { 







10 



15 



short n[MAXJMP]; /* size of jmp (neg for ddy) */ 
20 unsigned short x[MAXJMPJ; /* base no. of jmp in seq x */ 

}; /* limits seq to 2*16-1*/ 

struct diag { 

fart score; /* score at last jmp */ 

25 long ofifeet; /* ofifeet of prev block*/ 

short ijmp; /* current jmp index */ 

struct jmp jp; /* list of jmps */ 

}; 

30 struct path { 

int spc; /* number of leading spaces */ 

short n[JMPS];/* size of jmp (gap) */ 

int x[JMPS];/* loc of jmp (last elem before gap) */ 

35 * 



40 int 



45 



50 



char 




♦ofile; 


/* output file name */ 


char 




*namex[2]; 


/* seq names: getseqsO */ 


char 




*prog; 


/* prog name for err msgs */ 


char 




*seqx[2]; 


/* seqs: getseqsO */ 






dmax; 


/* best diag: nwO */ 


int 




dmaxO; 


/* final diag */ 


int 




dna; 


/* set if dna: mainO */ 


int 




endgaps; 


/* set if penalizing end gaps */ 


int 




gapx, gapy; 


/* total gaps in seqs */ 


int 




lenO, leal; 


/* seq lens *J 


int 




ngapx, ngapy; 


/* total size of gaps */ 


int 




smax; 


/* max score: nwO */ 


int 




*xbm; 


/* bitmap for matching */ 


long 




ofi&et; 


/* current offset in jmp file */ 


struct 


diag 


*dx; 


y* holds diagonals */ 


struct 


path 


pp[2]; 


/* holds path for seqs */ 


char 




♦callocO, *mallocO, *indexO, *strcpy0; 


char 




♦getseqQ, *g_callocO; 
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Table 1 (conn 

/* Needleman-Wunsch alignment program 
* 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

5 * The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lfres beginning with ';\ or '<' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 
10 * 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
*/ 

^include "nw.1T 
15 #include "day.h" 

static _dbval[26] = { 

1,14,2,13,0,0,4,11,0,0,12,0,3,15,0,0,0,5,6,8,8,7,9,0,10,0 

20 * 

static jpbval[26] = { 

1, 2|(1 < <('D'-'A'))|(1< <('N'-'A')), 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1< < 10, 1<<11, 1 < < 12, 1<<13, 1<<14, 
1<<15, 1<<16, 1<<17, 1<<18, 1<<19, 1< <20, 1<<21, 1<<22, 
25 1<<23, 1<<24, 1<<25|(1<<( , E'- , A , ))|(1<<CQ , - , A')) 

}; 

mam(ac, av) main 
int ac; 
30 cfaar *avQ; 

{ 

prog = av[0]; 
if(ac!=3){ 

j|>rintf(stderr, "usage: %s filel file2\n", prog); 
35 r>rintf(stderr, "where filel and file2 are two dna or two protein sequences.\n"); 

rprintf([stderr ? n The sequences can be in upper- or lower-case\n"); 

n>rintJ0[stderr,"Any lines beginning with V or * < 1 are ignored\n"); 

Q>rintf(stderr,"OutoM is in the file \ "align. out\"uT); 

exit(l); 

40 } 

namexfl)] = av[l]; 

namexfl] = av[2]; 

seqxR)] = getseq(namex[0], &len0); 

seqx[l] = getseq(namex[l], &Ienl); 
45 xbm = (dna)? dbval : j>bval; 

endgaps = 0; /* 1 to penalize endgaps */ 

ofile = "alignuout"; /* output file */ 

50 nwQ; /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

printO; /* print stats, alignment */ 

cleanup(0); /* unlink any tmp files */} 
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Table 1 (conf) 

/* do the alignment, return best score: mainO 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 



*/ 






nwO 






{ 








char 


*px,*py; 




int 


♦ndely. *dely; 




int 


ndelx, delx; 




int 


*tmp; 




int 


mis; 




int 


insO, insl; 




register 


id; 




register 
register 
register 


ii; 

*coI0, *coll; 
xx, yy; 



nw 



10 char *px, *py; f* seqs and ptrs */ 

/* keep track of dely*/ 
/♦keep track of delx*/ 
/* for swapping rowO, rowl */ 
/* score for each type */ 

15 int insO, insl; /* insertion penalties */ 

/* diagonal index */ 
/* jmp index */ 
/* score for curr, last row */ 
/* index into seqs */ 

20 

dx = (struct diag ^gjcallocCto get diags", lenO-f lenl+1, sizeof(struct diag)); 
ndely = (int *)g_calloc("to get ndely\ lenl + 1, sizeorXint)); 
dely = (int *)gj*HocTto get dely % lenl + 1 , sizeof(int)); 
colO = (int *)gjcalloc("tD get colO", lenl+1, sizeof(int)); 
25 coll - (int *)g_caIloc("to get coir, lenl + 1, sizeoCCint)); 

insO = (dna)? DINSO : PINSO; 
insl = (ana)? DINS1 : PINS1; 
smax = -10000; 
if (endgaps) { 

30 for (col0[0] = dely(0] = -insO, yy = l;yy < = lent; yy++){ 

col0[yy] = dery[yy] «= col0[yy-l] - insl; 
ndelytyy] = yy; 

} 

col0[0] = 0; /* Waterman Bull Math Biol 84 */ 

35 } 

else 

for (yy = 1; yy <= lenl; yy++) 
derytyy] = -insO; 
/* fill in match matrix 

40 •/ 

for (px = seqx[0], xx = 1; xx <= IenO; px++, xx++) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 
45 if(xx==l) 

coll[0] = delx = -(insO+insl); 



coil[0] = delx - col0[0] - insl; 
ndelx = xx; 



coll[0) = 0; 
delx « -insO; 
ndelx = 0; 



253 



1/24/2006, EAST Version: 2.0.1.4 



WO 2004/030615 



PCT/US2003/028547 



Table 1 (conV) 

...mv 

for (py = seqx[l], yy = 1; yy < = leal; py++, yy+ +) { 
mis = colO[yy-l]; 
if(dna) 

mis + = (xbmt^x-'AI&xbm^y-'A'F DMAT : DMIS; 
mis + - ^day[*px- t A , ][*py- , A , l; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps ) | ndely[yy] < MAXGAP) { 

if (coi0[yy] - insO > - dely[yy]) { 

delyfyy] = colO[yy] - (insO+insl); 
ndelyfyy] - 1; 

}dse{ 

defytyy] -« insl; 
ndely[yy]++; 

} 

}dse{ 

if (col0[yy] - (insO+insl) > = delyfyy]) { 
delyfyy] = coK)[yy] - (insO+insl); 
ndelyfyy] = 1; 

} else 

ndelytyy]++; 

> 

/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps 1 1 ndelx < MAXGAP) { 

if (coll[yy-l] -insO >= delx) { 

delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

}else{ 

delx -= insl; 
ndelx++; 

} 

}e!se{ 

if (coll[yy-l] - (insO+insl) >= delx){ 
delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

} eke 

ndelx++; 

> 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
*/ 

id = xx- yy + lenl - 1; 
if (mis > = delx &&mis > = delyfyy]) 
collfyy] = mis; 
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Table 1 (conV) 

else if (delx > - dely[yy]) { 
colltyy] = delx; 
ij « dx[id].ijmp; 

if (dxjid] jp.n[0] && (Idna 1 1 (ndelx > = MAXJMP 
&&xx > dx{idJ.jp.x[ij]+MX) 1 1 mis > dx[id].score+DINSO)) { 
dx[id].ijnzp++; 
if >= MAXJMP) { 

writejmps(id); 
ij = dx[id]Jjmp = 0; 
dxpfl -offset - offset; 

offset += sizeof(stract jmp) + sizeo^offset); 

} 

> 

dx[id]jp.n[ij] = ndelx; 
dx[id].jp.x[ij] « xx; 
dx[id). score = delx; 

} 

colllyy] » dely[yy}; 
ij = dxfidl.ijmp; 
if(dxtid].jp.n[01 && (!dna 1 1 (ndely[yy] > = MAXJMP 

&& xx > dx[idj jp.xpj]+MX) | ) mis > dx[id].score+DINS0)) { 
dx[id].gmp+ + ; 
if (++8 >= MAXJMP) { 
writejmps(id); 
g « dx0d]ujmp = 0; 
dx[id]. offset = offset; 

ofifeet sizeof(structjmp) + sizeof(offeet); 

} 

> 

dx[idj.jp.n[ij] » -ndelyfyy]; 

dxpdl.jp.xnn = xx; 
dxftfl.score = dely[yy]; 

if (xx ==len0&&yy < lenl){ 
/♦last col 
1 

if (endgaps) 

colllyy] ins0+insl*(lenl-yy); 
if(coll[yy] > smax) { 

smax = coll[yy]; 

dmax = id; 

} 

} 

} 

if (endgaps && xx < lenO) 

coll[yy-l] ms0-finsl*(len0-xx); 
if (colllyy-l] > smax) { 

smax = colllyy-l]; 

dmax = id; 

tmp - colO; coH) = coll; coll = tmp; } 
(void) free((cfaar *)ndely); 
(void) freetfchar *)dely); 
(void)&ee((diar*)col0); 
(void)free((cliar*)coll); } 
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/* 

* printO - only routine visible outside this module 

5 * static: 

* getmatO — trace back best path, count matches: printO 

* pr_alignO — print alignment of described in array p[|: printO 

* dumpblockO — dump a block of lines with numbers, stars: pr_alignO 

* numsO - put out a number line: dumpblockO 

10 * putlineO ~ put out a line (name, [num], seq, [num]): dumpblockO 

* starsO - -put a line of stars: dumpblockO 

* strip nameO — strip any path and prefix from a seqname 
*/ 

15 #include "nw.h w 



20 



#deffne SPC 3 

^define PJJNE 256 /* maximum output line */ 

#derme P SPC 3 /* space between name or num and seq */ 

extern _day[26][26]; 

hit olen; /* set output line length */ 

FILE *rx; /* output file */ 



25 printO P rint 
{ 

int lx, ly, firstgap, lastgap; /* overlap */ 

if ((fe = fopen(ofile, "w")) == 0) { 
30 fprintJ9[stderr,"%s: can't write %sVn\ prog, ofile); 

cleanup(l); 

} 

fprintf(rx, "<rlrst sequence: %s (length « %d)\n", namex[0], lenO); 
rprintf(rx, *< second sequence: %s (length = %d)\n", namex[l], lenl); 
35 olen = 60; 

lx = lenO; 
ly = lenl; 

firstgap — lastgap = 0; 

if (dmax < lenl - 1) { /* l e adin g gap in x */ 
40 pp[0].spc = firstgap = lent - dmax - 1; 

ly -= pp[0].spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap « dmax - (lenl - 1); 
45 lx -= pp[l].spc; 

} 

if(dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
lx-=l 

50 } 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap = dmaxO - (lenO - 1); 
ly -« lastgap; 

} 

55 getmatOx, ly, firstgap, lastgap); 

pr_alignO; } 
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Table 1 (cont'> 

/* 

* trace back the best path, count matches 
*/ 

static 

5 getmat(lx, ly, firstgap, lastgap) getmat 
|nt lx, ly; /* "core" (minus endgaps) */ 

int firstgap, lastgap; /* leading trailing overlap */ 

int nm, iO, il, sizO, sizl; 

10 char outx[32]; 

double pet; 



nO t nl; 

register char *p0, *pl; 

/* get total matches, score 
15 */ 

iD = il = sizO = sizl = 0; 

pO = seqx[0] + pp[l].spc; 

pi = seqx[l] + pp[0].spc; 

nO -pp[l].spc + 1; 
20 nl = pp[0].spc + 1; 

nm = 0; 

while ( *p0 && *pl ) { 
if(srzO){ 

25 nl++; 

sizO-; 

} 

else if (sizl) { 

30 n0++; 

sizl-; 

} 

else { 

if (xbrnL^O-'A"]*^^!-'^]) 

35 

if(nO++ = = pp£0].x[iO]) 

sizO = pp[0].n[iO++]; 
if(al++ ==pp[1Lx[il]) 

sizl = pp[l].nDl++]; 

40 p0++; 

pi++; 



45 /* pet homology: 

* if penalizing endgaps, base is the shorter seq 
*else, knock off overhangs and take shorter core 
*/ 

if (endgaps) 

50 lx = OenO < lenl)? lenO : lenl; 

else 

lx = (U < ly)? lx : ly; 
pet = 100.*(double)nm/(double)lx; 
tprintfiCrx, "\n"); 

55 rprintf(rx, * < S&d matches in an overlap of %± % .2f percent simila^ity\n , ' , 

nm, (nm = = 1)? : n es\ lx. pet); 
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fprintf(fx, " < gaps in first sequence: %d" , gapx); • • • getmat 

if(gapx){ 

(void) sprintfftutx, ■ (%d %s%s) n , 

ngapx, (dna)? "base": "residue", (ngapx ~ 1)? MW : V); 
5 fprintf(fx,"%s\outx); 

fprintf(fx, \ gaps in second sequence: %d", gapy); 
if(gapy){ 

(void) sprintf(autx, " (%d %s%s) n 9 

ngapy, (dna)? "base" :"residue\ (ngapy 1)? ":V); 
10 lprinrf(rV%s\ outx); 

} 

if (dna) 

fprintf(fx, 

"\n<score: %d (match = %d, mismatch = %d, gap penalty = %d+ %d per base)\n\ 
15 smax, DMAT, DMIS, DINSO, DINSt); 

else 

iprintf(fx, 

"\n< score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 
smax, PINSO, PINS1); 
20 if (endgaps) 

iprintf(fx, 

'*< endgaps penalized, ieftendgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap == 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? : "s"); 

25 



fprintf(fx, " <endgaps not penalized\n"); 

} 

static nm; /* matches in core - for checking */ 



lmax; /* lengths of stripped file names */ 

30 static ijP]; /*jmp index for a path */ 

static nc[2]; /* number at start of current line */ 

nip]; /* current elem number — for gapping */ 



sizP]; 

static char *ps[2]; /* ptr to current element */ 

35 static char *J»P]; /* ptr to next output char slot */ 

static char outp][P_LINE]; /* output line */ 
static char star[P LINE]; /* set by starsO */ 
/* 

* print alignment of described in struct path ppQ 
40 */ 

static 

pr_alignO pralign 
{ 

int nn; /* char count */ 

45 int more; 

i; 



for (i = 0, lmax = 0; i < 2; i++) { 
nn = stripname(namex[f|); 
50 if (nn > lmax) 

lmax » nn; 
nc[i] - 1; 
m[il-l; 
siz[i] = ij[i] = 0; 
55 ps[i] = seqx[i]; 

pop] = outffl; 
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10 



15 



for (nn = nm = 0, more = 1; more; ) { 

for (i = more = 0; i < 2; i++) { 
/* 

* do we have more of mis sequence? 
*/ 

tf(!*ps[iD 

continue; 

more++; 

if (pp[i].spc) { /* leading space */ 
*po[i]++ = 1 '; 
pp[i].spc~; 



} 



iif(siz[i]){ /* in a gap*/ 
*poH++ - '-•; 
siz[i]-; 



..pr align 



20 



25 



30 



35 



40 



45 



50 



55 



} 



} 



/* we're putting a seq element 
*/ 

*po[i] = *ps[i]; 
ff(isIower(^s[iD) 

*ps[ij = toupper(*ps[i]); 

po[i]++; 
ps[i]++; 
/* 

* are we at next gap for this seq? 
*/ 

if(ni[il-=pp[i].x[ij[i]]){ 
f* 

* we need to merge all gap s 

* at this location 
*/ 

sizp] =pp[i].nDj[i] + +]; 
whfle (ni[i] == pp[i].x[ij[i]]) 

siz[i] +=ppDl.n[ijH++); 

> 

ni[i]++; 



if (++nn = = olen || !more&&nn) { 
dumpblockO; 
for(i = 0;i < 2; i++) 
pop] - out[i]; 

nn = 0; 

} 

} 

} 
/* 

* dump a block of lines, including numbers, stars: pr_align() 
*/ 

static 

dumpblockO 

{ 



dumpblock 



for(i=0;i < 2; i++) 
*po[0- - '\0'; 
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15 



20 



Table 1 (conn 

(void) putcOVa', fx); 
for(i = 0;i < 2; i++){ 

if (*out[i] && (*out[i] != " 1 1 *(po[i]) !=")){ 
if(i==0) 

aums(i); 
if(i ==0&&*out[l]) 

starsO; 
putline®; 

if(i === 0&&*out[i]) 

fprinttj(fx, star); 
if(i==l) 

nums(i); 



} 



> 



} 
/* 

* put out a number line: dumpblockQ 

*/ 

static 
nums(ix) 

int ix; /* index in outO holding seq line */ 



...dumpblock 



nums 



25 



30 
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char nlineJPLINE]; 
register i,j; 
register char *pn, *px, *py; 
for (pn = nline, i = 0; i < imax+P SPC; i++, pn++) 
*pn - ' 

for(i = nc[ix], py = outfix]; *py; py++, pn++) { 

if(*py || *py 

*pn = "; 



else{ 



if (i%10 == 0 || Ci == 1 &&nc[ix] != 1)) { 
j=(i<0)?-i:i; 
for (px = pn; j; j /- 10, px-) 
*px=j%10 + '0'; 

if (i < 0) 

*px = 



} 



} 

else 



*pn = 



} 

*pn = '\0'; 
nc[ix] = i; 

for (pn — nline; *pn; pn+ +) 
(void) putc(*pn, 6c); 
(void) putc('\n\ fit); 

} 
/* 

* put out a line (name, [sum], seq, [num]): dumpblockQ 
*/ 

static 
pufline(ix) 

bit ix; { 



putline 
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...putline 

int i; 
register char *px; 

for (px = namex[ix], i - 0; *px && *px ! - V; px+ + , i+ +) 

(void) putc(*px, fx); 
for (; i < imax+PJSPC; i++) 

(void)putoC 

/* these count from 1: 

* niO is current element (from 1) 

* ncQ is number at start of current line 
*/ 

for (px = outpx]; *px; px++) 

(void) putc(*px&0x7F, fx); 
(void)putc( , \n\ fie); 

} 



/* 

* put a line of stars (seqs always in out[0], out[l]): dumpblockQ 
*/ 

static 
starsO 
{ 

int i; 

register char *p0, *pl, cx, *px; 

if (!*out[01 1 1 (*out[0] = = " && *(po[0]) — ' ') 1 1 
!*out[l] 1 1 (*out[l] « ' * && *(po[l]) = = 1 '» 
return; 
px = star; 

for (i = Imax+PSPC; i; i-) 
*px+ + - ' 

for (pO = outlO], pi - outfl]; *p0 && *pl; p0++, pl + +) { 
if (isalpba(*p0) && isalpha(*pl)) { 

if (xbIn{*pO- , A , ]&xbm[*pl- , A , ]) { 
cx = '*•; 
nm++; 

Leif(!dna^ - aay[*pO- t A , ][*pl- , A , ] > 0) 
cx — . ; 

else 

cx="; 



*px+ + = cx; 

> 

*px++ » ^5 
*px = 'VO'; 



stars 
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/* 

* strip path or prefix from pn, return lem pr_align() 
*/ 

static 

stripname(pn) stripname 
char *pn; /* file name (may be path) */ 

{ 

register diar *px, *py; 



10 py = 0; 

for (px = pn; *px; px+ +) 
lf(*px== T) 

py - px + 1; 

if(py) 

15 (void) strcpy(pn, py); 

retum(strlen(pn)); 



20 
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/* 

* cleanupO — cleanup any tap file 

* getseqO — read in seq, set dna, len, marien 

* g callocO - callocO with error checkin 

* readjmpsO - get the goodjmps, from tap file if necessary 

* writejmpsO - write a filled array of jmps to a tap file: nwO 
*/ 

^include "nw.h" 
^include <sys/file.h> 



*jname = "/tmp/homgXXXXXX"; 
*5; 



char 
FILE 

int cleanupO; 
long lseefcQ; 
/* 

* remove any trap file if we blow 
*/ 

cleanup(i) 

int i; 

{ 

if (9) 

(void) unlinkdname); 

exit(i); 

} 

/* 

* read, return ptr to seq, set dna, lea, maxlen 

* skip lines starting with 1 '<', «'>• 

* seq in upper or lower case 
*/ 

char * 

getseq(fHe, len) 

char *file; /* file name */ 
int *lea; /* seq len*/ 



/♦trap file for jmps */ 
/* cleanup trap file */ 



{ 



cleanup 



getseq 



char line[1024], *pseq; 

register char *px. *py; 

int natgc, den; 

FILE *fp» 

if((fp = fopen(file,V)) — 0) { 

Q>rintf(stderr,"%s: can't read 9Ss\n", prog, file); 

ent(l); 

> 

den = natgc = 0; 

while (fgetsOine, 1024, tp)) { 

if(*line== ■;■ |) *line — '<' 1 1 *line — '>') 

continue; 
for (px = line; *px != V; px++) 

if (isupper(*px) 1 1 islower(*px)) 
tien++; 

if ((pseq = ma!loc((unsigned)(tlen+6))) == 0) { 

rprintftstderr/as: mallocO feiled to get %d bytes for %s\n\ prog, tlen+6, file); 
exit(l); 

pseq[0] - pseq[i] = pseq[2] = pseq[3] = *\0'; 
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...getseq 

py « pseq + 4; 
♦lea = tlen; 
rewind(rp); 

5 while (fgetsGine, 1024, fp)) { 

if(*line= = •;• j 1 *line == • <' 1 1 *line = = •>') 
continue; 

for (px = line; *px != '\n'; px++) { 
if (isupper(*px)) 

10 *py++ = *px; 

t else if (islower(*px)) 

*py++ = toupper(*px); 
if (index( n ATGCU\^-l))) 
natgc+ + ; 

15 } 
} 

*py++ = 'XQ 1 ; 
*py = no 4 ; 
(void) fclose(fp); 
20 dna = natgc > (tien/3); 

return(pseq+4); 

char * 

g^calloc(msg, nx, sz) g_callOC 
25 char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

{ 

char *px, *calloc0; 

if ((px = calloc((unsigned)nx r (unsigned)sz)) == 0) { 
30 if(*msg){ 

r>rintf(stderr, w %s: &_calloc0 Med %s (n=%d, sz=%d)\n\ prog, msg, nx, sz); 

exit(l); 

} > 

35 return(px); 
} 

I* 

* get final jmps from dxQ or tmp file, setppQ, reset dmax: mainO 
40 */ 

readjmpso readjmps 

{ 

int fd = -1; 

int siz, iO, il; 

45 register i,j,xx; 

if(tj){ 

(void) fclose(fj); 

if ((fd « open(jname, 0_RDONLY, 0)) < 0) { 

r^rintf(stderr, "%s: can't openO %s\n\ prog, jname); 
50 cleanup(l); 

} 

} 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; i++) { 
while (1){ 

55 for (j = dx[dmax].ijmp; j > = 0 && dx[dmax].jp.xD] > => xx; j-) 
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...readjmps 

if (j < 0 && dx[dmax].offeet && fi) { 

(void) lseek(fd, dx[dmax]. offset, 0); 
(void) read(fa, (char *)&dx[dmax]jp» sizeof(struct jmp)); 
5 (void) read(fd, (char *)&dx[dmax]. offset, sizeof(dx[dmax]. offset)); 

dx[dmax].ijmp = MAXJMP-1; } 

else 

break; } 

if(i>=JMPS){ 

10 rprinrrXstrJerr, ff %s: too many gaps in alignment\n M , prog); 

cleanup(l); 

} 

if(j>=0){ 

siz = dx[dniax].jp.nQ]; 
15 xx = dxfdbraax] jp.xfl]; 

dmax += siz; 

if (siz < 0) { /* gap in second seq */ 

pp[l].n[it] - -siz; 
xx += siz; 

20 /*id = xx-yy + lenl - 1 */ 

pp[l].x[il] = xx- dmax + lenl - 1; 

gapy++; 

ngapy-= siz; 
/* ignore MAXGAP when doing endgaps */ 
25 siz = (-siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 

ii++r 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[i0] = siz; 

30 pp[0].x[i0] = xx; 

gapx++; 
ngapx + = siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
35 i0++; 

} 

} 

else 

break; 

40 } 

/* reverse the order of jmps */ 
for(j =0,i0-;j < i0;j ++,i0-){ 

i = pp[0].n[j]; pp[0].nO] = pp[0].n[i0]; pp[0).n[i0] = i; 

i = PP[0].xDl; pp[0].xD] = PPtO] .rfOl: PP[0].x[i0] - i; 

45 } 

for(j =0, il-;j < il;j++,il-){ 

i = pp[l].n|}]; pp[l].nDl = pp[l].n[il]; pp[l].n[il] = i; 
i = PPULxlj]; pp[l].xDl - pp[l].x[il]; pp[l].x[il] = i; 

50 if(fii> = 0) 

(void) close(M); 

if(«){ 

(void) unlink(jname); 
tj=0; 

55 offset = 0; 

> > 
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Table 1 (conV) 

/* 

* write a filled jmp struct offeet of the prev one (if any): nwO 

writejmps(ix) WlitejmpS 
int ix; 

{ 

char *mktempO; 
if(!rj){ 

if (mktempQname) < 0) { 

§>rintf(stderr, "%s: can't mktempO %s\n", prog, jname); 
cleanup(l); 

} 

if ((rj = fbpen(jname, "w")) == 0) { 

fjprintf(stderr, "%s: can't write %s\n\ prog, jname); 
exitfi); 

} 

} 

(void) fwrite((char *)&dxflxj.jp, sizeof(struct jmp), 1, rj); 
(void) fwrite((char *)&dxpx]-orrset, sizeof(dx[ix].offtet), 1, rj); 

} 
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Table 2 

TAT XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

5 % amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the TAT polypeptide) = 

10 5 divided by 15 = 33.3% 

Table 3 

TAT XXXXXXXXXX (Length = 10 amino acids) 

15 Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
20 by ALIGN-2) divided by (die total number of amino acid residues of the TAT polypeptide) = 

5dividedbyl0 = 50% 

Table 4 



25 



30 



TAT-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length = 16 nucleotides) 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between die two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the TAT-DNA nucleic acid sequence) = 
6 divided by 14 = 42.9% 
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Table 5 

TAT-DNA NNNNNNNNNNNN (Length = 12 nucleotides) 

Comparison DNA NNNNLLLVV (Length = 9 nucleotides) 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the TAT-DNA nucleic acid sequence) = 

4dividedbyl2 = 33.3% 

IL Compositions and Methods of the Invention 

A. Anti-TAT Antibodies 

In one embodiment, the present invention provides anti-TAT antibodies which may find use herein as 
therapeutic and/or diagnostic agents. Exemplary antibodies include polyclonal, monoclonal, humanized, 
bispecific, and heteroconjugate antibodies. 

1. Polyclonal Antibodies 

Polyclonal antibodies are preferably raised in animals by multiple subcutaneous (sc) or intraperitoneal 
(ip) injections of the relevant antigen and an adjuvant. It may be useful to conjugate the relevant antigen 
(especially when synthetic peptides are used) to a protein that is immunogenic in the species to be immunized. 
For example, the antigen can be conjugated to keyhole limpet hemocyanin (KLH), serum albumin, bovine 
thyroglobulin, or soybean trypsin inhibitor, using a bifunctional or derivatizing agent, e.g. , maleimidobenzoyl 
sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues), 
glutaraldehyde, succinic anhydride, SOCl 2 , or R 1 N=C==NR, where R and R 1 are different alkyl groups. 

Animals are immunized against the antigen, immunogenic conjugates, or derivatives by combining, 
e.g. , 100 ng or 5 jig of the protein or conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's 
complete adjuvant and injecting the solution intradennally at multiple sites. One month later, the animals are 
boosted with 1/5 to 1/10 the original amount of peptide or conjugate in Freund's complete adjuvant by 
subcutaneous injection at multiple sites. Seven to 14 days later, the animals are bled and the serum is assayed 
for antibody titer. Animals are boosted until die titer plateaus. Conjugates also can be made in recombinant 
cell culture as protein fusions. Also, aggregating agents such as alum are suitably used to enhance the immune 
response. 

2. Monoclonal Antibodies 

Monoclonal antibodies may be made using the hybridoma method first described by Kohler et al., 
Nature. 256:495 (1975), or may be made by recombinant DNA methods (U.S. Patent No. 4,816,567). 

In the hybridoma method, a mouse or other appropriate host animal, such as a hamster, is immuniz ed 

268 



1/24/2006, EAST Version: 2.0.1.4 



WO 2004/030615 



PCT/US2003/028547 



as described above to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the protein used for immunization. Alternatively, lymphocytes may be immunized in vitro. After 
immunization, lymphocytes are isolated and then fused with a myeloma cell line using a suitable fusing agent, 
such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and P r actice , 
pp.59-103 (Academic Press, 1986)). 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium which medium 
preferably contains one or more substances that inhibit the growth or survival of the unfused, parental myeloma 
cells (also referred to as fusion partner). For example, if the parental myeloma cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the selective culture medium for the 
hybridomas typically will include hypoxanthine, aminopterin, and thymidine (HAT medium), which substances 
prevent the growth of HGPRT-deficient cells. 

Preferred fusion partner myeloma cells are those that fuse efficiently, support stable high-level 
production of antibody by the selected antibody-producing cells, and are sensitive to a selective medium that 
selects against die unfused parental cells. Preferred myeloma cell lines are murine myeloma lines, such as those 
derived from MOPC-21 and MPC-11 mouse tumors available from the Salk Institute Cell Distribution Center, 
San Diego, California USA, and SP-2 and derivatives e.g., X63-Ag8-653 cells available from the American 
Type Culture Collection, Manassas, Virginia, USA. Human myeloma and mouse-human heteromyeloma cell 
lines also have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol., 
133:3001 (1984); andBrodeur et aL, Monoclonal Antibody Production Tech niques and Applications, pp. 51-63 
(Marcel Dekker, Inc., New York, 1987)). 

Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies produced 
by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunosorbent assay (ELISA). 

The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard 
analysis described in Munson et al., Anal. Biochem.. 107:220 (1980). 

Once hybridoma cells that produce antibodies of the desired specificity, affinity, and/or activity are 
identified, the clones may be subcloned by limiting dilution procedures and grown by standard methods (Goding, 
Monoclonal Antibodies: Principles and Practice, pp.59-103 (Academic Press, 1986)). Suitable culture media 
for this purpose include, for example, D-MEM or RPMI-1640 medium. In addition, the hybridoma cells may 
be grown in vivo as ascites tumors in an animal e.g,, by i.p. injection of the cells into mice. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture medium, 
ascites fluid, or serum by conventional antibody purification procedures such as, for example, affinity 
chromatography (e.g M using protein A or protein G-Sepharose) or ion-exchange chromatography, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, etc. 

DNA encoding the monoclonal antibodies is readily isolated and sequenced using conventional 
procedures (e.g. , by using oligonucleotide probes that are capable of binding specifically to genes encoding the 
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heavy and light chains of murine antibodies). The hybridoma cells serve as a preferred source of such DNA. 
Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells such 
as E. colt cells, simian COS cells, Chinese Hamster Ovary (CHO) cells, or myeloma cells that do not otherwise 
produce antibody protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. Review 
articles on recombinant expression in bacteria of DNA encoding the antibody include Skerra et aL, Curr. 
5 Opinion in Immunol.. 5:256-262 (1993) and Pluckthun, Immunol. Revs. 130: 151-188 (1992). 

In a further embodiment, monoclonal antibodies or antibody fragments can be isolated from antibody 
phage libraries generated using the techniques described in McCafferty et aL, Nature. 348:552-554 (1990). 
Clacksonet aL, Nature. 352:624-628 (1991) and Marks et al., J. Mol. Biol.. 222:581-597 (1991) describe the 
isolation of murine and human antibodies, respectively, using phage libraries. Subsequent publications describe 
10 the production of high affinity (nM range) human antibodies by chain shuffling (Marks et al., Bio/Technology, 
10:779-783 (1992)), as well as combinatorial infection and in vivo recombination as a strategy for constructing 
very large phage libraries (Waterhouse et al. , Nuc. Acids. Res. 21:2265-2266 (1993)). Thus, these techniques 
are viable alternatives to traditional monoclonal antibody hybridoma techniques for isolation of monoclonal 
antibodies. 

15 The DNA that encodes the antibody may be modified to produce chimeric or fusion antibody 

polypeptides, for example, by substituting human heavy chain and light chain constant domain (C H and CJ 
sequences for the homologous murine sequences (U.S. Patent No. 4,816,567; and Morrison, et al., Proc. Natl 
Acad. Sci. USA. 81:6851 (1984)), or by fusing the immunoglobulin coding sequence with all or part of the 
coding sequence for a non-immunoglobulin polypeptide (heterologous polypeptide). The non-immunoglobulin 

20 polypeptide sequences can substitute for the constant domains of an antibody, or they are substituted for the 
variable domains of one antigen-combining site of an antibody to create a chimeric bivalent antibody comprising 
one antigen-combining site having specificity for an antigen and another antigen-combining site having 
specificity for a different antigen. 

3. Human and Humanized Antibodies 

25 The anti-TAT antibodies of the invention may further comprise humanized antibodies or human 

antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human, immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 

30 complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 
some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise 

35 substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 
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those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
et al., Nature, 321:522-525 (1986); Riechmann et al., Nature. 332:323-329 (1988); and Presta, Curr. Op. 
Struct. Biol.. 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an 
"import" variable domain. Humanization can be essentially perfomed following the method of Winter and co- 
workers [Jones et al., Nature. 321:522-525 (1986); Riechmann et al., Nature. 332:323-327 (1988); Verhoeyen 
et aL, Science. 239: 1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of ahuman antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by die 
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 
antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

The choice of human variable domains, both light and heavy, to be used in making the humanized 
antibodies is very important to reduce antigenicity and HAMA response (human anti-mouse antibody) when the 
antibody is intended for human therapeutic use. According to the so-called "best-fit" method, the sequence of 
the variable domain of a rodent antibody is screened against the entire library of known human variable domain 
sequences. The h uman V domain sequence which is closest to that of the rodent is identified and the human 
framework region (FR) within it accepted for the humanized antibody (Sims et al., J. Immunol. 151:2296 
(1993); Chothia et al., J. MoL Biol., 196:901 (1987)). Another method uses a particular framework region 
derived from the consensus sequence of all human antibodies of a particular subgroup of light or heavy chains. 
The same framework may be used for several different humanized antibodies (Carter et al., Proc. Nad. Acad. 
Sci. USA. 89:4285 (1992); Presta et al., J- Immunol. 151:2623 (1993)). 

It is further important that antibodies be humanized with retention of high binding affinity for die 
antigen and other favorable biological properties. To achieve this goal, according to a preferred method, 
humanized antibodies are prepared by a process of analysis of the parental sequences and various conceptual 
humanized products using three-dimensional models of the parental and humanized sequences. Three- 
dimensional immunoglobulin models are commonly available and are familiar to those skilled in the art. 
Computer programs are available which illustrate and display probable three-dimensional conformational 
structures of selected candidate immunoglobulin sequences. Inspection of these displays permits analysis of die 
likely role of the residues in the functioning of die candidate immunoglobulin sequence, i.e., the analysis of 
residues that influence the ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues 
can be selected and combined from the recipient and import sequences so that the desired antibody 
characteristic, such as increased affinity for the target antigen(s), is achieved. In general, the hypervariable 
region residues are directly and most substantially involved in influencing antigen binding. 
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Various forms of a humanized anti-TAT antibody are contemplated. For example, the humanized 
antibody may be an antibody fragment, such as a Fab, which is optionally conjugated with one or more cytotoxic 
agent(s) in order to generate an immunoconjugate. Alternatively, the humanized antibody may be an intact 
antibody, such as an intact IgGl antibody. 

As an alternative to humanization, human antibodies can be generated. For example, it is now possible 
5 to produce transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full repertoire 
of human antibodies in the absence of endogenous immunoglobulin production. For example, it has been 
described that the homozygous deletion of the antibody heavy-chain joining region (J H ) gene in chimeric and 
germ-line mutant mice results in complete inhibition of endogenous antibody production. Transfer of the human 
germ-line immunoglobulin gene array into such germ-line mutant mice will result in the production of human 

1 0 antibodies upon antigen challenge. See, e.g., Jakobovits et al. , Proc. Natl. Acad. Sci. USA, 90:2551 (1993); 
Jakobovits et al., Nature. 362:255-258 (1993); Bruggemann et al., Year in Immuno. 7:33 (1993); U.S. Patent 
Nos. 5,545,806, 5,569,825, 5,591,669 (all of GenFharm); 5,545,807; and WO 97/17852. 

Alternatively, phage display technology (McCafferty et al., Nature 348:552-553 [1990]) can be used 
to produce human antibodies and antibody fragments in vitro, from immunoglobulin variable (V) domain gene 

1 5 repertoires from unimmunized donors. According to this technique, antibody V domain genes are cloned in- 
frame into either a major or minor coat protein gene of a filamentous bacteriophage, such as M13 or fd, and 
displayed as functional antibody fragments on the surface of the phage particle. Because the filamentous particle 
contains a single-stranded DNA copy of the phage genome, selections based on the functional properties of the 
antibody also result in selection of the gene encoding the antibody exhibiting those properties. Thus, the phage 

20 mimics some of the properties of the B-cell. Phage display can be performed in a variety of formats, reviewed 
in, e.g., Johnson, Kevin S. and Chiswell, David J., Current Opinion in Structural Biology 3:564-571 (1993). 
Several sources of V-gene segments can be used for phage display. Clackson et al. Nature. 352:624-628 (1991) 
isolated a diverse array of anti-oxazolone antibodies from a small random combinatorial library of V genes 
derived from the spleens of immunized mice. A repertoire of V genes from unimmunized human donors can 

25 be constructed and antibodies to a diverse array of antigens (including self-antigens) can be isolated essentially 
following the techniques described by Marks etal., J. Mol. Biol. 222:581-597 (1991), or Griffith etal., EMBO 
L 12:725-734 (1993). See, also, U.S. Patent Nos. 5,565,332 and 5,573,905. 

As discussed above, human antibodies may also be generated by in vitro activated B cells (see U.S. 
Patents 5,567,610 and 5,229,275). 

30 4. Antibody fragments 

In certain circumstances there are advantages of using antibody fragments, rather than whole 
antibodies. The smaller size of the fragments allows for rapid clearance, and may lead to improved access to 
solid tumors. 

Various techniques have been developed for the production of antibody fragments. Traditionally, these 
35 fragments were derived via proteolytic digestion of intact antibodies (see, e. g. , Morimoto et al. , Journal of 
Biochemical and Biophysical Methods 24:107-117 (1992); and Brennan et al., Science. 229:81 (1985)). 
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However, these fragments can now be produced directly by recombinant host cells. Fab, Fv and ScFv antibody 
fragments can all be expressed in and secreted from E. cpli, thus allowing the facile production of large amounts 
of these fragments. Antibody fragments can be isolated from the antibody phage libraries discussed above. 
Alternatively, Fab'-SH fragments can be directly recovered from E. coli and chemically coupled to form F(ab') 2 
fragments (Carter et al., Bio/Technology 10:163-167 (1992)). According to another approach, F(ab') 2 
fragments can be isolated directly from recombinant host cell culture. Fab and F(ab'^ fragment with increased 
in vivo half-life comprising a salvage receptor binding epitope residues are described in U.S. Patent No. 
5,869,046. Other techniques for the production of antibody fragments will be apparent to the skilled 
practitioner. In other embodiments, the antibody of choice is a single chain Fv fragment (scFv). See WO 
93/16185; U.S. Patent No. 5,571,894; and U.S. Patent No. 5,587,458. Fv and sFv are the only species with 
intact combining sites that are devoid of constant regions; thus, they are suitable for reduced nonspecific binding 
during in vivo use. sFv fusion proteins may be constructed to yield fusion of an effector protein at either the 
amino or the carboxy terminus of an sFv. See Antibody Engineering, ed. Borrebaeck, supra. The antibody 
fragment may also be a "linear antibody", e.g. , as described in U.S. Patent 5,641,870 for example. Such linear 
antibody fragments may be monospecific or bispecific. 

5. Bispecific Antibodies 

Bispecific antibodies are antibodies that have binding specificities for at least two different epitopes. 
Exemplary bispecific antibodies may bind to two different epitopes of a TAT protein as described herein. Other 
such antibodies may combine a TAT binding site with a binding site for another protein. Alternatively, an anti- 
TAT arm may be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell 
receptor molecule (e.g. CD3), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRH (CD32) and 
FcyRHI (CD 16), so as to focus and localize cellular defense mechanisms to the TAT-expressing cell. Bispecific 
antibodies may also be used to localize cytotoxic agents to cells which express TAT. These antibodies possess 
a TAT-binding arm and an arm which binds the cytotoxic agent (e.g. , saporin, anti-interferon-a, vinca alkaloid, 
ricin A chain, methotrexate or radioactive isotope hapten). Bispecific antibodies can be prepared as full length 
antibodies or antibody fragments (e.g., F(ab , ) 2 bispecific antibodies). 

WO 96/16673 describes a bispecific anti-ErbB2/anti-FcYRIII antibody and U.S. Patent No. 5,837,234 
discloses a bispecific anti-ErbB2/anti-FcyRI antibody. A bispecific anti-ErbB2/Fc a antibody is shown in 
WO98/02463. U.S. Patent No. 5,821,337 teaches a bispecific anti-ErbB2/anti-CD3 antibody. 

Methods for making bispecific antibodies are known in the art. Traditional production of full length 
bispecific antibodies is based on the co-expression of two immunoglobulin heavy chain-light chain pairs, where 
the two chains have different specificities (Millstein et al. , Nature 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential 
mixture of 10 different antibody molecules, of which only one has the correct bispecific structure. Purification 
of the correct molecule, which is usually done by affinity chromatography steps, is rather cumbersome, and the 
product yields are low. Similar procedures are disclosed in WO 93/08829, and in Traunecker et al. , EMBO 
JL 10:3655-3659 (1991). 
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According to a different approach, antibody variable domains with the desired binding specificities 
(antibody-antigen combining sites) are fused to immunoglobulin constant domain sequences. Preferably, the 
fusion is with an Ig heavy chain constant domain, comprising at least part of the hinge, C H 2, and C H 3 regions. 
It is preferred to have the first heavy-chain constant region (C H 1) containing the site necessary for light chain 
bonding, present in at least one of the fusions. DNAs encoding the immunoglobulin heavy chain fusions and, 
if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected 
into a suitable host cell. This provides for greater flexibility in adjusting the mutual proportions of the three 
•polypeptide fragments in embodiments When unequal ratios of the three polypeptide chains used in the 
construction provide the optimum yield of the desired bispecific antibody. It is, however, possible to insert the 
coding sequences for two or all three polypeptide chains into a single expression vector when the expression of 
at least two polypeptide chains in equal ratios results in high yields or when the ratios have no significant affect 
on the yield of the desired chain combination. 

In a preferred embodiment of this approach, the bispecific antibodies are composed of a hybrid 
immunoglobulin heavy chain with a first binding specificity in one arm, and a hybrid immunoglobulin heavy 
chain-light chain pair (providing a second binding specificity) in the other arm. It was found that this 
asymmetric structure facilitates the separation of the desired bispecific compound from unwanted 
immunoglobulin chain combinations, as the presence of an immunoglobulin ligjit chain in only one half of the 
bispecific molecule provides for a facile way of separation. This approach is disclosed in WO 94/04690. For 
further details of generating bispecific antibodies see, for example, Suresh et al., Methods in Enzvmology 
121:210 (1986). 

According to another approach described in U.S. Patent No. 5,731,168, the interface between a pair 
of antibody molecules can be engineered to Tnaximire the percentage of heterodimers which are recovered from 
recombinant cell culture. The preferred interface comprises at least a part of the C H 3 domain. In this method, 
one or more small amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g., tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of Che second antibody molecule by replacing large amino acid 
side chains with smaller ones (e.g., alanine or threonine). This provides a mechanism for increasing the yield 
of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies include cross-linked or "heteroconjugate" antibodies. For example, one of the 
antibodies in the heteroconjugate can be coupled to avidin, the other to biotin. Such antibodies have, for 
example, been proposed to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for 
treatment of HIV infection (WO 91/00360, WO 92/200373, and BP 03089). Heteroconjugate antibodies may 
be made using any convenient cross-linking methods. Suitable cross-linking agents are well known in the art, 
and are disclosed in U.S. Patent No. 4,676,980, along with a number of cross-linking techniques. 

Techniques for generating bispecific antibodies from antibody fragments have also been described in 
the literature. For example, bispecific antibodies can be prepared using chemical linkage. Brennanet al., 
Sqience 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate 
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FCab 1 ^ fragments. These fragments are reduced in the presence of the dithiol completing agent, sodium 
arsenite, to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab 1 fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then 
reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of 
the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced can be used 
as agents for the selective immobilization of enzymes. 

Recent progress has facilitated the direct recovery of Fab'-SH fragments from £. colt which can be 
chemically coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175: 217-225 (1992) describe 
the production of a folly humanized bispecific antibody F(ab , ) 2 molecule. Each Fab 1 fragment was separately 
secreted from E. coli and subjected to directed chemical coupling in vitro to form the bispecific antibody. The 
bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human 
T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly from recombinant 
cell culture have also been described. For example, bispecific antibodies have been produced using leucine 
zippers. Kostelny et al.,JJmmunoL 148(5): 1547-1 553 (1992).- The leucine zipper peptides from the Fos and 
Jun proteins were linked to the Fab 1 portions of two different antibodies by gene fusion. The antibody 
homodimers were reduced it the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimere. This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a V H connected to 
a V L by a linker which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the V H and V L domains of one fragment are forced to pair with the complementary V L and V H domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See Gruber et al.,J. Immunol^ 
152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can 
be prepared. Tntt pt a1 , L Immunol. 147:60 (1991). 

6. Heteroconiupate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed 
to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV 
infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared 
in vitro using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, immunotoxins maybe constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and 
those disclosed, for example, in U.S. Patent No. 4,676,980. 

7. Multivalent Antibodies 
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A multivalent antibody may be internalized (and/or catabolized) faster than a bivalent antibody by a 
cell expressing an antigen to which the antibodies bind. The antibodies of the present invention can be 
multivalent antibodies (which are other than of the IgM class) with three or more antigen binding sites (e.g. 
tetravalent antibodies), which can be readily produced by recombinant expression of nucleic acid encoding the 
polypeptide chains of the antibody. The multivalent antibody can comprise a dimerization domain and three or 
5 more antigen binding sites. The preferred dimerization domain comprises (or consists of) an Fc region or a 
hinge region. In this scenario, the antibody will comprise an Fc region and three or more antigen binding sites 
ammo-terminal to the Fc region. The preferred multivalent antibody herein comprises (or consists of) three to 
about eigjht, but preferably four, antigen binding sites. The multivalent antibody comprises at least one 
polypeptide chain (and preferably two polypeptide chains), wherein the polypeptide chain(s) comprise two or 

10 more variable domains. For instance, the polypeptide chain(s) may comprise VD1-(X1) n -VD2-(X2) n -Fc, 
wherein VD1 is a first variable domain, VD2 is a second variable domain, Fc is one polypeptide chain of an 
Fc region, XI and X2 represent an amino acid or polypeptide, and n is 0 or 1. For instance, the polypeptide 
chain(s) may comprise: VH-CHl-flexible linker-VH-CHl-Fc region chain; or VH-CHl-VH-CHl-Fc region 
chain. The multivalent antibody herein preferably further comprises atleasttwo (and preferably four) light chain 

15 variable domain polypeptides. The multivalent antibody herein may, for instance, comprise from about two to 
about eight light chain variable domain polypeptides. The light chain variable domain polypeptides contemplated 
here comprise a light chain variable domain and, optionally, further comprise a CL domain. 
8. Effector Function Engineering 
It may be desirable to modify the antibody of the invention with respect to effector function, e.g., so 

20 as to enhance antigen-dependent cell-mediated cyotoxicity (ADCQ and/or complement dependent cytotoxicity 
(CDC) of the antibody. This may be achieved by introducing one or more amino acid substitutions in an Fc 
region of the antibody. Alternatively or additionally, cysteine residue(s) may be introduced in the Fc region, 
thereby allowing interchain disulfide bond formation in this region. The homodimeric antibody thus generated 
may have improved internalization capability and/or increased complement-mediated cell killing and antibody- 

25 dependent cellular cytotoxicity (ADCQ. See Caron et al. J. Exp Med. 176:1191-1195 (1992) and Shopes, B. 
I Tminimnl. 148:2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity may also be 
prepared using heterobifunctional cross-linkers as described in Wolff et al., Cancer Research 53:2560-2565 
(1993). Alternatively, an antibody can be engineered which has dual Fc regions and may thereby have enhanced 
complement lysis and ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design 3:219-230 (1989). 

30 To increase the serum half life of die antibody, one may incorporate a salvage receptor binding epitope 

into the antibody (especially an antibody fragment) as described in U.S. Patent 5,739,277, for example. As used 
herein, the term "salvage receptor binding epitope* refers to an epitope of the Fc region of an IgG molecule 
(e.g., IgGi, IgG 2 , IgG3, or IgG^ that is responsible for increasing the in vivo serum half-life of the IgG 
molecule. 

35 9. Immunoconjiiffates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic 
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agent such as a chemotherapeutic agent, a growth inhibitory agent, a toxin (e.g. , an enzymatically active toxin 
of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope ( i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described 
above. Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, 
5 nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety 
of radionuclides are available for the production of radioconjugated antibodies. Examples include 212 Bi, l3l I, 

1 0 UI In, 90 Y, and I86 Re. Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), 
bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl 
suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as bis-(p-diazomumberizoyl)^thylenedlamhie), diisocyanates 

15 (such as tolyene 2,6-diisocyanate), andbis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). 
For example, a ricin imrnunotoxin can be prepared as described in Vitetta et al. , Science. 238: 1098 (1987). 
Carbon-14-labeled l-isouiiocyariatobenzyl-3--methyldiethylene triaminepentaacetic acid (MX-DTPA) is an 
exemplary chelating agent for conjugation of radionucleotide to the antibody. See W094/1 1026. 

Conjugates of an antibody and one or more small molecule toxins, such as a calicheamicin, 

20 maytansinoids, a trichothene, and CC1065, and the derivatives of these toxins that have toxin activity, are also 
contemplated herein. 
Mavtansine and mavtansmoids 

In one preferred embodiment, an anti-TAT antibody (full length or fragments) of the invention is 
conjugated to one or more maytansinoid molecules. 

25 Maytansinoids are mitototic inhibitors which act by inhibiting tubulin polymerization. Maytansine was 

first isolated from the east African shrub Maytenus serrata (U.S. Patent No. 3,896,111). Subsequently, it was 
discovered that certain microbes also produce maytansinoids, such as maytansinol and C-3 maytansinol esters 
(U.S. Patent No. 4, 151,042). Synthetic maytansinol and derivatives and analogues thereof are disclosed, for 
example, in U.S. Patent Nos. 4,137,230; 4,248,870; 4,256,746; 4,260,608; 4,265,814; 4,294,757; 4,307,016; 

30 4,308,268; 4,308,269; 4,309,428; 4,313,946; 4,315,929; 4,317,821; 4,322,348; 4,331,598; 4,361,650; 
4,364,866; 4,424,219; 4,450,254; 4,362,663; and 4,371,533, the disclosures of which are hereby expressly 
incorporated by reference. 
Mavtansinoid-antibodv conjugates 

In an attempt to improve their therapeutic index, maytansine and maytansinoids have been conjugated 

35 to antibodies specifically binding to tumor cell antigens. Immunoconjugates containing maytansinoids and their 
therapeutic use are disclosed, for example, in U.S. Patent Nos. 5,208,020, 5,416,064 and European Patent EP 
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0 425 235 Bl , the disclosures of which are hereby expressly incorporated by reference. Liu et al.,Proc. Natl. 
Acad. Sci. USA 93:8618-8623 (1996) described immunoconjugates comprising a maytansinoid designated DM1 
tinted to the monoclonal antibody C242 directed against human colorectal cancer. The conjugate was found 
to be highly cytotoxic towards cultured colon cancer cells, and showed antitumor activity in an in vivo tumor 
growth assay. Charietal.,. Cancer Research 52:127-131 (1992) describe immunoconjugates in which a 
maytansinoid was conjugated via a disulfide linker to the murine antibody A7 binding to an antigen on human 
colon cancer cell lines, or to another murine monoclonal antibody TA.l that binds the HER-2/neu oncogene. 
The cytotoxicity of the TA. 1-maytansonoid conjugate was tested in vitro on the human breast cancer cell line 
SK-BR-3, which expresses 3 x 10 5 HER-2 surface antigens per cell. The drug conjugate achieved a degree of 
cytotoxicity similar to the free maytansonid drug, which could be increased by increasing the number of 
maytansinoid molecules per antibody molecule. The A7-maytansinoid conjugate showed low systemic 
cytotoxicity in mice. 

Anti-TAT polypeptide antibodv-mavtansmoid conjugates fimmunoconiugates) 

Anti-TAT antibody-maytansinoid conjugates are prepared by chemically linking an anti-TAT antibody 
to a maytansinoid molecule without significantly diminishing the biological activity of either the antibody or the 
maytansinoid molecule. An average of 3-4 maytansinoid molecules conjugated per antibody molecule has 
shown efficacy in enhancing cytotoxicity of target cells without negatively affecting the function or solubility 
of the antibody, although even one molecule of toxin/antibody would be expected to enhance cytotoxicity over 
the use of naked antibody. Maytansinoids are well known in the art and can be synthesized by known techniques 
or isolated from natural sources. Suitable maytansinoids are disclosed, for example, in U.S. Patent No. 
5,208,020 and in the other patents and nonpatent publications referred to hereinabove. Preferred maytansinoids 
are maytansinol and maytansinol analogues modified in the aromatic ring or at other positions of the maytansinol 
molecule, such as various maytansinol esters. 

There are many linking groups known in the art for making antibody-maytansinoid conjugates, 
including, for example, those disclosed in U.S. Patent No. 5,208,020 or EP Patent 0 425 235 Bl, and Chari 
et ai„ Cancer Research 52:127-131 (1992). The linking groups include disufide groups, thioether groups, acid 
labile groups, photolabile groups, peptidase labile groups, or esterase labile groups, as disclosed in the above- , 
identified patents, disulfide and thioether groups being preferred. 

Conjugates of the antibody and maytansinoid may be made using a variety of bifunctional protein 
coupling agents such as N-su(ximinidyW-(2-pyridyldithio) propionate (SPDP), succinimidyl-4-(N- 
maleimidomethyi) cyclohexane-l-carboxylate, iminothiolane (IT), bifunctional derivatives of imidoesters (such 
as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
gjutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives 
(such as bis-(p-diazoniumbenzoyl)^ diisocyanates (such as toluene 2,6-diisocyanate), and bis- 

active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). Particularly preferred coupling agents 
include N-succininudyl-3K2^ [1978]) 
and N-succinimidyi-4-(2-pyridyltMo)pentanoate (SPP) to provide for a disulfide linkage. 
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The linker may be attached to the maytansinoid molecule at various positions, depending on the type 
of the link. For example, an ester linkage may be formed by reaction with a hydroxyl group using conventional 
coupling techniques. The reaction may occur at the C-3 position having a hydroxyl group, the C-14 position 
modified with hyrdoxymethyl, the C-15 position modified with a hydroxyl group, and the C-20 position having 
a hydroxyl group. In a preferred embodiment, the linkage is formed at the C-3 position of maytansinol or a 
maytansinol analogue. 
Calicheamicin 

Another immunoconjugate of interest comprises an anti-TAT antibody conjugated to one or more 
calicheamicin molecules. The calicheamicin family of antibiotics are capable of producing double-stranded 
DNA breaks at sub-picomolar concentrations. For the preparation of conjugates of the calicheamicin family, 
see U.S. patents 5,712,374, 5,714,586, 5,739,116, 5,767,285, 5,770,701, 5,770,710, 5,773,001, 5,877,296 
(all to American Cyanamid Company). Structural analogues of calicheamicin which may be used include, but 
are not limited to, y t \ aj, of, N-acetyl^ 1 , PSAG and d\ (Hinman et al., Cancer Research 53:3336-3342 
(1993), Lode et al., Cancer Research 58:2925-2928 (1998) and the aforementioned U.S. patents to American 
Cyanamid). Another anti-tumor drug that the antibody can be conjugated is QFA which is an antifolate. Both 
calicheamicin and QFA have intracellular sites of action and do not readily cross the plasma membrane. 
Therefore, cellular uptake of these agents through antibody mediated internalization greatly enhances their 
cytotoxic effects. 
Other cytotoxic agents 

Other antitumor agents that can be conjugated to the anti-TAT antibodies of the invention include 
BCNU, streptozoicin, vincristine and 5-fluorouracil, the family of agents known collectively LL-E33288 
complex described in U.S. patents 5,053,394, 5,770,710, as well as esperamicins (U.S. patent 5,877,296). 

Enzymatically active toxins and fragments thereof which can be used include diphtheria A chain, 
nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaoharia 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin and the tricothecenes. See, for 
example, WO 93/21232 published October 28, 1993. 

The present invention further contemplates an immunoconjugate formed between an antibody and a 
compound with nucleolytic activity (e.g., a ribonuclease or a DNA endonuclease such as a deoxyribonuclease; 
DNase). 

For selective destruction of the tumor, the antibody may comprise a highly radioactive atom. A variety 
of radioactive isotopes are available for the production of radioconjugated anti-TAT antibodies. Examples 
include At 211 , 1 131 , 1 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 , Pb 2n and radioactive isotopes of Lu. When the 
conjugate is used for diagnosis, it may comprise a radioactive atom for scintigraphic studies, for example to 99 " 1 
or I , or a spin label for nuclear magnetic resonance (NMR) imaging (also known as magnetic resonance 
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imaging, mri), such as iodine- 123 again, iodine-131, indium-Ill, fluorine-19, carbon-13, nitrogeri-15, oxygen- 
17, gadolinium, manganese or iron. 

The radio- or other labels may be incorporated in the conjugate in known ways. For example, the 
peptide may be biosynthesized or may be synthesized by chemical amino acid synthesis using suitable amino 

_99m 123 186 

acid precursors involving, for example, fluorine-19 in place of hydrogen. Labels such aster or I , .Re , 
5 Re 188 and In 111 can be attached via a cysteine residue in the peptide. Yttrium-90 can be attached via a lysine 
residue. The IODOGEN method (Fraker et al (1978) Biochem. Biophys. Res. Commun. 80: 49-57 can be used 
to incorporate iodine-123. "Monoclonal Antibodies inImmunoscintigraphy w (Chatal,CRC Press 1989) describes 
other methods in detail. 

Conjugates of die antibody and cytotoxic agent may be made using a variety of bifunctional protein 

10 coupling agents such as N-suc^inimidyl-3-(2-pyridylditibio) propionate (SPDP), succinimidyl-4-(N- 

maleimidomethyl) cyclohexane-l-carboxylate, iminothiolane (IT), bifunctional derivatives of imidoesters (such 
as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives 
(such as bis-(p-diazoniumbenz»yl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis- 

15 active fluorine compounds (such as l,5-difhioro-2,4-dinitrobenzene). For example, a ricin immunotoxin can 
be prepared as described in Vitetta et al. , Science 238: 1098 (1987). Carbon-14-labeled 1-isothiocyanatobenzyl- 
3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of 
radionucleotide to the antibody. See W094/ 11026. The linker may be a "cleavable linker" facilitating release 
of 1he cytotoxic drug in the cell. For example, an acid-labile linker, peptidase-sensitive linker, photolabile 

20 linker, dimethyl linker or disulfide-containing linker (Chari et al., Cancer Research 52:127-131 (1992); U.S. 
Patent No. 5,208,020) may be used. 

Alternatively, a fusion protein comprising the anti-TAT antibody and cytotoxic agent may be made, 
e.g., by recombinant techniques or peptide synthesis. The length of DNA may comprise respective regions 
encoding the two portions of the conjugate either adjacent one another or separated by a region encoding a linker 

25 peptide which does not destroy the desired properties of the conjugate. 

In yet another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pre-targeting wherein the antibody-receptor conjugate is administered to the patient, followed 
by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) which is conjugated to a cytotoxic agent (e.g., a radionucleotide). 

30 10. Immunoliposomes 

The anti-TAT antibodies disclosedherein may also be formulated as immunoliposomes. A "liposome" 
is a small vesicle composed of various types of lipids, phospholipids and/or surfactant which is useful for 
delivery of a drug to a mammal. The components of die liposome are commonly arranged in a bilayer 
formation, similar to the lipid arrangement of biological membranes. Liposomes containing the antibody are 

35 prepared by methods known in the art, such as described in Epstein etal., Proc. Natl. Acad. Sci. USA 82:3688 
(1985); Hwang et aL. Proc. Nad Acad. Sci. USA 77:4030 (1980); U.S. Pat. Nos. 4,485,045 and 4,544,545; 
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and W097/38731 published October 23, 1997. Liposomes with enhanced circulation time are disclosed in U.S. 
Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol and PEG-derivatized phosphatidyletlmolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
5 Fab* fragments of the antibody of the present invention can be conjugated to the liposomes as described in 

Martin et al., J. Biol. Chem. 257:286-288 (1982) via a disulfide interchange reaction. A chemotherapeutic 
agent is optionally contained within the liposome. See Gabizon et al., J. National Cancer Inst. 81(19):1484 
(1989). 

B. TAT Binding Oligopeptides 

1 0 TAT binding oligopeptides of the present invention are oligopeptides that bind, preferably specifically, 

to a TAT polypeptide as described herein. TAT binding oligopeptides may be chemically synthesized using 
known oligopeptide synthesis methodology or may be prepared and purified using recombinant technology. 
TAT binding oligopeptides are usually at least about 5 amino acids in length, alternatively at least about 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 

15 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 
65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 
93, 94, 95, 96, 97, 98, 99, or 100 amino acids in length or more, wherein such oligopeptides that are capable 
of binding, preferably specifically, to a TAT polypeptide as described herein. TAT binding oligopeptides may 
be identified without undue experimentation using well known techniques. In this regard, it is noted that 

20 techniques for screening oligopeptide libraries for oligopeptides that are capable of specifically binding to a 
polypeptide target are well known in the art (see, e.g., U.S. Patent Nos. 5,556,762, 5,750,373, 4,708,871, 
4,833,092, 5,223,409, 5,403,484, 5,571,689, 5,663,143; PCT Publication Nos. WO 84/03506 and 
WO84/03564; Geysen et al., Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Geysen et al., Proc. Natl. 
Acad. Sci. U.S.A., 82:178-182 (1985);' Geysen et al., in Synthetic Peptides as Antigens, 130-149 (1986); 

25 Geysen et al., J. Immunol. Meth., 102:259-274 (1987); Schoofs et al., J. Immunol., 140:611-616 (1988), 
Cwirla, S. E. et al. (1990) Proc. Natl. Acad. Sci. USA, 87:6378; Lowman, H.B. et al. (1991) Biochemistry, 
30:10832; Clackson, T. et al. (1991) Nature, 352: 624; Marks. J. D. et al. (1991), J. Mol. Biol., 222:581; 
Rang, A.S. et al. (1991) Proc. Natl. Acad. Sci. USA, 88:8363, and Smith, G. P. (1991) Current Opin. 
Biotechnol., 2:668). 

30 In this regard, bacteriophage (phage) display is one well known technique which allows one to screen 

large oligopeptide libraries to identify member(s) of those libraries which are capable of specifically binding 
to a polypeptide target. Phage display is a technique by which variant polypeptides are displayed as fusion 
proteins to the coat protein on the surface of bacteriophage particles (Scott, J.K. and Smith, G. P. (1990) 
Science 249: 386). The utility of phage display lies in the fact that large libraries of selectively randomized 

3 5 protein variants (or randomly cloned cDNAs) can be rapidly and efficiently sorted for those sequences that bind 
to a target molecule with high affinity. Display of peptide (Cwirla, S. E. et al. (1990) Proc. Natl. Acad. Sci. 
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USA, 87:6378) or protein (Lowman, H.B. et aL (1991) Biochemistry, 30:10832; Clackson, T. et al. (1991) 
Nature, 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) Proc. Natl. 
Acad, Sci. USA, 88:8363) libraries on phage have been used for screening millions of polypeptides or 
oligopeptides for ones with specific binding properties (Smith, G. P. (1991) Current Opin. Biotechnol., 2:668). 
Sorting phage libraries of random mutants requires a strategy for constructing and propagating a large number 
5 of variants, a procedure for affinity purification using the target receptor, and a means of evaluating the results 
of binding enrichments. U.S. Patent Nos. 5,223,409, 5,403,484, 5,571,689, and 5,663,143. 

Although most phage display methods have used filamentous phage, lambdoid phage display systems 
(WO 95/34683; U.S. 5,627,024), T4 phage display systems (Ren, Z-J. et al. (1998) Gene 215:439; Zhu, Z. 
(1997) CAN 33:534; Jiang, J. et al. (1997) can 128:44380; Ren, Z-J. et al. (1997) CAN 127:215644; Ren, Z-J. 

10 (1996) Protein Sci. 5:1833; Efimov, V. P. et al. (1995) Virus Genes 10:173) and T7 phage display systems 
(Smith, G. P. and Scott, J.K. (1993) Methods mEnzymology, 217, 228-257; U.S. 5,766,905) are also known. 

Many other improvements and variations of the basic phage display concept have now been developed. 
These improvements enhance the ability of display systems to screen peptide libraries for binding to selected 
target molecules and to display functional proteins with the potential of screening these proteins for desired 

15 properties. Combinatorial reaction devices for phage display reactions have been developed (WO 98/14277) 
and phage display libraries have been used to analyze and control bimolecular interactions (WO 98/20169; WO 
98/20159) and properties of constrained helical peptides (WO 98/20036). WO 97/35196 describes a method 
of isolating an affinity ligand in which a phage display library is contacted with one solution in which the ligand 
will bind to a target molecule and a second solution in which the affinity ligand will not bind to the target 

20 molecule, to selectively isolate binding ligands. WO 97/46251 describes a method of biopanning a random 
phage display library with an affinity purified antibody and then isolating binding phage, followed by a 
micropanning process using microplate wells to isolate high affinity binding phage. The use ofStqphfylococcus 
aureus protein A as an affinity tag has also been reported (Li et al. (1998) Mol Biotech. , 9: 187). WO 97/47314 
describes the use of substrate subtraction libraries to distinguish enzyme specificities using a combinatorial 

25 library which may be a phage display library. A method for selecting enzymes suitable for use in detergents 
using phage display is described in WO 97/09446. Additional methods of selecting specific binding proteins 
are described in U.S. Patent Nos. 5,498,538, 5,432,018, and WO 98/15833. 

Methods of generating peptide libraries and screening these libraries are also disclosed in U.S. Patent 
Nos. 5,723,286, 5,432,018, 5,580,717, 5,427,908, 5,498,530, 5,770,434, 5,734,018, 5,698,426, 5,763,192, 

30 and 5/723,323. 

C. TAT Binding Organic Molecules 

TAT binding organic molecules are organic molecules other than oligopeptides or antibodies as defined 
herein mat bind, preferably specifically, to a TAT polypeptide as described herein. TAT binding organic 
molecules may be identified and chemically synthesized using known methodology (see, e.g. , PCT Publication 
35 Nos. WOOO/00823 and WO00/39585). TAT binding organic molecules are usually less man about 2000 daltons 
in size, alternatively less than about 1500, 750, 500, 250 or 200 daltons in size, wherein such organic molecules 
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that are capable of binding, preferably specifically, to a TAT polypeptide as described herein may be identified 
without undue experimentation using well known techniques. In this regard, it is noted that techniques for 
screening organic molecule libraries for molecules that are capable of binding to a polypeptide target are well 
known in the art (see, e.g., PCT Publication Nos. WO00/00823 and WO00/39585). TAT binding organic 
molecules may be, for example, aldehydes, ketones, oximes, hydrazones, semicarbazones, carbazides, primary 
5 amines, secondary amines, tertiary amines, N-substituted hydrazines, hydrazides, alcohols, ethers, thiols, 
thioethers, disulfides, carboxylic acids, esters, amides, ureas, carbamates, carbonates, ketals, thioketals, acetals, 
thioacetals, aryi halides, aryl sulfonates, alkyl halides, alkyl sulfonates, aromatic compounds, heterocyclic 
compounds, anilines, alkenes, alkynes, diols, amino alcohols, oxazolidines, oxazolines, thiazolidines, 
thiazolines, enamines, sulfonamides, epoxides, aziridines, isocyanates, sulfonyl chlorides, dtazo compounds, 

10 acid chlorides, or the like. 

D. Screening for Anti-TAT Antibodies. TAT Binding Oligopeptides and TAT Binding Organic 

Molecules With the Desired Properties 
Techniques for generating antibodies, oligopeptides and organic molecules that bind to TAT 
polypeptides have been described above. One may further select antibodies, oligopeptides or other organic 

1 5 molecules with certain biological characteristics, as desired. 

The growth inhibitory effects of an anti-TAT antibody, oligopeptide or other organic molecule of the 
invention may be assessed by methods known in the art, e.g., using cells which express a TAT polypeptide 
either endogenously or following transfection with the TAT gene. For example, appropriate tumor cell lines 
and TAT-transfected cells may treated with an anti-TAT monoclonal antibody, oligopeptide or other organic 

20 molecule of the invention at various concentrations for a few days (e.g. , 2-7) days and stained with crystal violet 
or MTT or analyzed by some other colorimetric assay. Another method of measuring proliferation would be 
by comparing 3 H-thymidine uptake by the cells treated in the presence or absence an anti-TAT antibody, TAT 
binding oligopeptide or TAT binding organic molecule of the invention. After treatment, the cells are harvested 
and the amount of radioactivity incorporated into the DNA quantitated in a scintillation counter. Appropriate 

25 positive controls include treatment of a selected cell line with a growth inhibitory antibody known to inhibit 
growth of that cell line. Growth inhibition of tumor cells in vivo can be determined in various ways known in 
the art. Preferably, the tumor cell is one that overexpresses a TAT polypeptide. Preferably, the anti-TAT 
antibody, TAT binding oligopeptide or TAT binding organic molecule will inhibit cell proliferation of a TAT- 
expressing tumor cell in vitro or in vivo by about 25-100% compared to the untreated tumor cell, more 

30 preferably, by about 30-100%, and even more preferably by about 50-100% or 70-100%, in one embodiment, 
at an antibody concentration of about 0.5 to 30 ^g/rnl. Growth inhibition can be measured at an antibody 
concentration of about 0.5 to 30 ng/ml or about 0.5 nM to 200 nM in cell culture, where the growth inhibition 
is determined 1-10 days after exposure of the tumor cells to the antibody. The antibody is growth inhibitory 
in vivo if administration of the anti-TAT antibody at about 1 ng/kg to about 100 mg/kg body weight results in 

35 reduction in tumor size or reduction of tumor cell proliferation within about 5 days to 3 months from the first 
administration of the antibody, preferably within about 5 to 30 days. 
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To select for an anti-TAT antibody, TAT binding oligopeptide or TAT binding organic molecule which 
induces cell death, loss of membrane integrity as indicated by, e.g. , propidium iodide (PI), trypan blue or 7AAD 
uptake may be assessed relative to qontrol. A PI uptake assay can be performed in the absence of complement 
and immune effector cells. TAT polypeptide-expressing tumor cells are incubated with medium alone or 
medium containing the appropriate anti-TAT antibody (e.g, at about 10ng/ml), TAT binding oligopeptide or 
5 TAT binding organic molecule. The cells are incubated for a 3 day time period. Following each treatment, 
cells are washed and aliquoted into 35 mm strainer-capped 12 x 75 tubes (1ml per tube, 3 tubes per treatment 
group) for removal of cell clumps. Tubes then receive PI (10 iig/mi). Samples may be analyzed using a 
FACSCAN® flow cytometer and FACSCONVERT® CellQuest software (Becton Dickinson). Those anti-TAT 
antibodies, TAT binding oligopeptides or TAT binding organic molecules that induce statistically significant 
10 levels of cell death as determined by PI uptake may be selected as cell death-inducing anti-TAT antibodies, TAT 
binding oligopeptides or TAT binding organic molecules. 

To screen for antibodies, oligopeptides or other organic molecules which bind to an epitope on a TAT 
polypeptide bound by an antibody of interest, a routine cross-blocking assay such as that described in 
Antibodies. A Laboratory Manual. Cold Spring Harbor Laboratory, Ed Harlow and David Lane (1988), can 
15 be performed. This assay can be used to determine if a test antibody, oligopeptide or other organic molecule 
binds fee same site or epitope as a known anti-TAT antibody. Alternatively, or additionally, epitope mapping 
can be performed by methods known in the art . For example, die antibody sequence can be mutagenized such 
as by alanine scanning, to identify contact residues. The mutant antibody is initailly tested for binding with 
polyclonal antibody to ensure proper folding. In a different method, peptides corresponding to different regions 
20 of a TAT polypeptide can be used in competition assays with the test antibodies or with a test antibody and an 
antibody with a characterized or known epitope. 

E. Antibody Dependent Enzvme Mediated Prodrug Therapy (ADEPT) 
The antibodies of the present invention may also be used in ADEPT by conjugating the antibody to a 
prodrug-activating enzyme which converts a prodrug (e.g., a peptidyl chemotherapeutic agent, see 
25 WO81/01 145) to an active anti-cancer drug. See, for example, WO 88/07378 and U.S. Patent No. 4,975,278. 

The enzyme component of the immunoconjugate useful for ADEPT includes any enzyme capable of 
acting on a prodrug in such a way so as to covert it into its more active, cytotoxic form. 

Enzymes that are useful in the method of this invention include, but are not limited to, alkaline 
phosphatase useful for converting phosphate-containing prodrugs into free drugs; aryisulfatase useful for 
30 converting sulfate-containing prodrugs into free drugs; cytosine deaminase useful for converting non-toxic 5- 
fluorocytosine into the anti-cancer drug, 5-fluorouracil; proteases, such as serratia protease, thermolysin, 
subtilisin, carboxypeptidases and cathepsins (such as cathepsins B and L), that are useful for converting peptide- 
containing prodrugs into free drugs; D-alanylcarboxypeptidases, useful for converting prodrugs that contain D- 
amino acid substituents; carbohydrate-cleaving enzymes such as p-galactosidase and neuraminidase useful for 
35 converting glycosylated prodrugs into free drugs; P-lactamase useful for converting drugs derivatized with p- 
lactams into free drugs; and penicillin amidases, such as penicillin V amidase or penicillin G amidase, useful 
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for converting drugs derivatized at their amine nitrogens with phenoxyacetyl or phenylacetyl groups, 
respectively, into free drugs. Alternatively, antibodies with enzymatic activity, also known in the art as 
"abzymes", can be used to convert the prodrugs of the invention into free active drugs (see, e.g., Massey, 
Nature 328:457-458 (1987)). Antibody-abzyme conjugates can be prepared as described herein for delivery of 
the abzyme to a tumor cell population. 
5 The enzymes of this invention can be covalently bound to the anti-TAT antibodies by techniques well 

known in the art such as the use of the heterobifunctional crosslinking reagents discussed above. Alternatively, 
fusion proteins comprising at least the antigen binding region of an antibody of the invention linked to at least 
a functionally active portion of an enzyme of the invention can be constructed using recombinant DNA 
techniques well known in the art (see, e.g., Neuberger et ai., Nature 312:604-608 (1984). 
10 F. Full-Length TAT Polypeptides 

The present invention also provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as TAT polypeptides . In particular, cDNAs (partial and full- 
length) encoding various TAT polypeptides have been identified and isolated, as disclosed in further detail in 
the Examples below. 

15 As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC . Hie 

actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of 
the deposited clone using routine methods in the art. The predicted amino acid sequence can be determined 
from the nucleotide sequence using routine skill. For the TAT polypeptides and encoding nucleic acids 
described herein, in some cases, Applicants have identified what is believed to be the reading frame best 

20 identifiable with the sequence information available at the time. 

G. Anti-TAT Antibody and TAT Polypeptide Variants 

In addition to the anti-TAT antibodies and full-length native sequence TAT polypeptides described 
herein, it is contemplated that anti-TAT antibody and TAT polypeptide variants can be prepared. Anti-TAT 
antibody and TAT polypeptide variants can be prepared by introducing appropriate nucleotide changes into the 
25 encoding DNA, and/or by synthesis of the desired antibody or polypeptide. Those skilled in the art will 

appreciate that amino acid changes may alter post-translational processes of the anti-TAT antibody or TAT 
polypeptide, such as changing die number or position of glycosylation sites or altering the membrane anchoring 
characteristics. 

Variations in the anti-TAT antibodies and TAT polypeptides described herein, can be made, for 
30 example, using any of the techniques and guidelines for conservative and non-conservative mutations set forth, 
for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or 
more codons encoding the antibody or polypeptide that results in a change in the amino acid sequence as 
compared with the native sequence antibody or polypeptide. Optionally the variation is by substitution of at least 
one amino acid with any other amino acid in one or more of the domains of the anti-TAT antibody or TAT 
35 polypeptide. Guidance in determining which amino acid residue may be inserted, substituted or deleted without 
adversely affecting the desired activity may be found by comparing the sequence of the anti-TAT antibody or 
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TAT polypeptide with that of homologous known protein molecules and minimizing the number of amino acid 
sequence changes made in regions of high homology. Amino acid substitutions can be the result of replacing 
one amino acid with another amino acid having similar structural and/or chemical properties, sucn as the 
replacement of a leucine with a serine, i.e. , conservative amino acid replacements. Insertions or deletions may 
optionally be in the range of about 1 to 5 amino acids. The variation allowed may be determined by 
systematically making insertions, deletions or substitutions of amino acids in the sequence and testing die 
resulting variants for activity exhibited by the full-length or mature native sequence. 

Anti-TAT antibody and TAT polypeptide fragments are provided herein. Such fragments may be 
truncated at the N-tenninus or C-terminus, or may lack internal residues, for example, when compared with 
a full length native antibody or protein. Certain fragments lack amino acid residues that are not essential for 
a desired biological activity of the anti-TAT antibody or TAT polypeptide. 

Anti-TAT antibody and TAT polypeptide fragments may be prepared by any of a number of 
conventional techniques. Desired peptide fragments may be chemically synthesized. An alternative approach 
involves generating antibody or polypeptide fragments by enzymatic digestion, e.g., by treating die protein with, 
an enzyme known to cleave proteins at sites defined by particular amino acid residues, or by digesting the DNA 
with suitable restriction enzymes and isolating the desired fragment. Yet another suitable technique involves 
isolating and amplifying a DNA fragment encoding a desired antibody or polypeptide fragment, by polymerase 
chain reaction (PCR). Oligonucleotides that define the desired termini of the DNA fragment are employed at 
the 5' and 3' primers in the PCR. Preferably, anti-TAT antibody and TAT polypeptide fragments share at least 
one biological and/or immunological activity with the native anti-TAT antibody or TAT polypeptide disclosed 
herein. 

In particular embodiments , conservative substitutions of interest are shown in Table 6 under the heading 
of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 
changes, denominated exemplary substitutions in Table 6, or as further described below in reference to amino 
acid classes, are introduced and the products screened. 
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Table 6 



Original 


Exemplary 


Pref 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; ile 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp(D) 


glu 


glu 


Cys (C) 


ser 


ser 


Gln(Q) 


asn 


asn 


Ghi(E) 


asp 


asp 


Gly (G) 


pro; ala 


ala 


His (H) 


asn; gin; lys; arg 


arg 


lie (I) 


leu; val; met; ala; phe; 






norleucine 


leu 


Leu (L) 


norleucine; ile; val; 






met; ala; phe 


ile 


Lys (K) . 


arg; gin; asn 


arg 


Met(M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala; tyr 


leu 


Pro(P) 


ala 


ala 


Ser (S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp(W) 


tyr; phe 


tyr 


Tyr(Y) 


trp; phe; thr; ser 


phe 


Val(V) 


ile; leu; met; phe; 






ala; norleucine 


leu 



Substantial modifications in function or immunological identity of the anti-TAT antibody or TAT 
polypeptide are accomplished by selecting substitutions that differ significantly in their effect on maintaining 
(a) the structure of die polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydropbobicity of the molecule at the target site, or (c) the bulk of the side 
chain. Naturally occurring residues are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, to; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another 
class. Such substituted residues also may be introduced into the conservative substitution sites or, more 
preferably, into the remaining (non-conserved) sites. 
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The variations can be made using methods known in the art such as oligonucleotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al. Nucl. 
Acids Res. . 13:4331 (1986); Zoller et al., Nucl. Acids Res.. 10:6487 (1987)], cassette mutagenesis [Wells et 
al., Gene, 34:315 (1985)], restriction selection mutagenesis [Wells etaL, Philos. Trans. R. Soc. London SerA. 
317:415 (1986)] or other known techniques can be performed on the cloned DNA to produce the anti-TAT 
antibody or TAT polypeptide variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. 
Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino 
acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the 
main-chain conformation of the variant [Cunningham and Wells, Science. 244:1081-1085 (1989)]. Alanine is 
also typically preferred because it is the most common amino acid. Further, it is frequently found in both buried 
and exposed positions [Creighton, The Proteins. (W.H. Freeman & Co., N. Y.); Chothia, J. MoL Biol.. 150:1 
(1976)] . If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

Any cysteine residue not involved in maintaining the proper conformation of the anti-TAT antibody 
or TAT polypeptide also may be substituted, generally with serine, to improve the oxidative stability of the 
molecule and prevent aberrant crosslinking. Conversely, cysteine bond(s) may be added to the anti-TAT 
antibody or TAT polypeptide to improve its stability (particularly where the antibody is an antibody fragment 
such as an Fv fragment). 

A particularly preferred type of substitutional variant involves substituting one or more hypervariable 
region residues of a parent antibody (e.g., ahumanized or human antibody). Generally, the resulting variants) 
selected for further development will have improved biological properties relative to the parent antibody from 
winch they are generated. A convenient way for generating such substitutional variants involves affinity 
maturation using phage display. Briefly, several hypervariable region sites (e.g., 6-7 sites) are mutated to 
generate all possible amino substitutions at each site. The antibody variants thus generated are displayed in a 
monovalent fashion from filamentous phage particles as fusions to the gene HI product of M13 packaged within 
each particle. The phage-displayed variants are then screened for their biological activity (e.g., binding affinity) 
as herein disclosed. In order to identity candidate hypervariable region sites for modification, alanine scanning 
mutagenesis can be performed to identify hypervariable region residues contributing significantly to antigen 
binding. Alternatively, or additionally, it may be beneficial to analyze a crystal structure of the antigen-antibody 
complex to identify contact points between the antibody and human TAT polypeptide. Such contact residues 
and neighboring residues are candidates for substitution according to the techniques elaborated herein. Once 
such variants are generated, the panel of variants is subjected to screening as described herein and antibodies 
with superior properties in one or more relevant assays may be selected for further development. 

Nucleic acid molecules encoding amino acid sequence variants of the anti-TAT antibody are prepared 
by a variety of methods known in the art. These methods include, but are not limited to, isolation from a natural 
source (in the case of naturally occurring amino acid sequence variants) or preparation by oligonucleotide- 
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mediated (or site-directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared 
variant or a non-variant version of the anti-TAT antibody. 

H. Modifications of Anti-TAT Antibodies and TAT Polypeptides 

Covalent modifications of anti-TAT antibodies and TAT polypeptides are included within the scope of 
this invention. One type of covalent modification includes reacting targeted amino acid residues of an anti-TAT 
5 antibody or TAT polypeptide with an organic derivatizing agent that is capable of reacting with selected side 
chains or the N- or C- terminal residues of the anti-TAT antibody or TAT polypeptide. Derivatization with 
bifunctional agents is useful, for instance, for crosslinking anti-TAT antibody or TAT polypeptide to a water- 
insoluble support matrix or surface for use in the method for purifying anti-TAT antibodies, and vice-versa. 
Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-pbenylethane, glutaraldehyde, N- 
10 hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, 
including disuccinimidyl esters such as 3,3 , -dithiobis(sui:ciimiiidylpropionate), bifunctional maleimides such as 
bis-N-maleimido- 1 ,8-octane and agents such as mediyl-3-[(p-azidophenyl)ditiuo]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
15 groups of seryl or threonyl residues, methylation of the cc-amino groups of lysine, arginine, and histidine side 
chains [T.E. Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)], acetylation of the N-tenninal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the anti-TAT antibody or TAT polypeptide included within 
the scope of this invention comprises altering the native glycosylation pattern of the antibody or polypeptide. 
20 "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 

carbohydrate moieties found in native sequence anti-TAT antibody or TAT polypeptide (either by removing the 
underlying glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or 
adding one or more glycosylation sites that are not present in the native sequence anti-TAT antibody or TAT 
polypeptide. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, 
25 involving a change in die nature and proportions of the various carbohydrate moieties present. 

Glycosylation of antibodies and other polypeptides is typically either N-linked or O-linked. N-linked 
refers to the attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tripeptide 
sequences asparagine-X-serine and asparagine-X-tfareonine, where X is any amino acid except proline, are the 
recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. Thus, 
30 die presence of either of these tripeptide sequences in a polypeptide creates a potential glycosylation site. O- 
linked glycosylation refers to the attachment of one of the sugars N-aceylgalactosamine, galactose, or xylose 
to ahydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5-hydroxylysine may 
also be used. 
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Addition of glycosylation sites to the anti-TAT antibody or TAT polypeptide is conveniently 
accomplished by altering the ammo acid sequence such that it contains one or more of the above-described 
tripeptide sequences (for N-linked glycosylation sites). The alteration may also be made by the addition of, or 
substitution by, one or more serine or threonine residues to the sequence of the original anti-TAT antibody or 
TAT polypeptide (for Olinked glycosylation sites). The anti-TAT antibody or TAT polypeptide amino acid 
5 sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 

encoding the anti-TAT antibody or TAT polypeptide at preselected bases such that codons are generated that 
will translate into die desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the anti-TAT antibody or TAT 
10 polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described 

in the art, e.g., in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. 

Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the anti-TAT antibody or TAT polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding for amino acid 
15 residues that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and 

described, for instance, by Hakimuddin, et al., Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al., 

Anal. Biochem.. 118:131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides canbe achieved 

by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzvmol.. 

138:350 (1987). 

20 Another type of covalent modification of anti-TAT antibody or TAT polypeptide comprises linking the 

antibody or polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. The antibody or polypeptide also may be entrapped in 
microcapsules prepared, for example , by coacervation techniques or by interracial polymerization (for example , 

25 hydroxymethylcellulose or gelatin-microcapsules and poly-(metirylmethacylate) microcapsules, respectively), 
in colloidal drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano- 
particles and nanocapsules), or in macroemulsions. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences, 16th edition, Oslo, A., Ed., (1980). 

The anti-TAT antibody or TAT polypeptide of the present invention may also be modified in a way to 

30 form chimeric molecules comprising an anti-TAT antibody or TAT polypeptide fused to another, heterologous 
polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the anti-TAT antibody or TAT 
polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. 
The epitope tag is generally placed at the amino- or carboxyl- terminus of the anti-TAT antibody or TAT 

35 polypeptide. The presence of such epitope-tagged forms of the anti-TAT antibody or TAT polypeptide can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the anti-TAT 
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antibody or TAT polypeptide to be readily purified by affinity purification using an anti-tag antibody or another 
type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective antibodies 
are well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) 
tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol.. 8:2159-2165 (1988)]; 
the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al. , Molecular and Cellular 
5 Biology, £:3610-3616 (1985)]; and die Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky 
etal., Protein Engineering, 3(6):547-553(1990)]. Other tag polypeptides include the Flag-peptide[Hoppetal M 
BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al., Science, 255:192-194 (1992)]; 
an a-tubulin epitope peptide [Skinner et al., J. Biol. Chem., 266:15163-15166 (1991)]; and the T7 gene 10 
protein peptide tag [Lutz-Freyennuth et al., Proc. Natl. Acad. Sci. USA, 87:6393-6397 (1990)]. 

10 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the anti-T AT antibody 
or TAT polypeptide with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form 
of the chimeric molecule (also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of 
an IgG molecule. The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted 

15 or inactivated) form of an anti-TAT antibody or TAT polypeptide in place of at least one variable region within 
an Ig molecule. In a particularly preferred embodiment, the immunoglobulin fusion includes the hinge, CH 2 
andCH 3 , orthe hinge, CH„ CH 2 and CH 3 regions of an IgGl molecule. For the production of immunoglobulin 
fusions see also US Patent No. 5,428,130 issued June 27, 1995. 

I. Preparation of Anti-TAT Antibodies and TAT Polypeptides 

20 The description below relates primarily to production of anti-TAT antibodies and TAT polypeptides 

by culturing cells transformed or transfected with a vector containing anti-TAT antibody- and TAT polypeptide- 
encoding nucleic acid. It is, of course, contemplated that alternative methods, which are well known in die art, 
may be employed to prepare anti-TAT antibodies and TAT polypeptides. For instance, the appropriate amino 
acid sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques 

25 [see, e.g., Stewart et al., Solid-Phase Peptide Synthesis, W.H. Freeman Co., San Francisco, CA (1969); 

Merrifield, J. Am. Chem. Soc.. 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using 
manual techniques or by automation. Automated synthesis may be accomplished, for instance, using an Applied 
Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the 
anti-TAT antibody or TAT polypeptide may be chemically synthesized separately and combined using chemical 

30 or enzymatic methods to produce the desired anti-TAT antibody or TAT polypeptide. 

1. Isolation of PNA Encoding Anti-TAT Antibody or TAT Polypeptide 
DNA encoding anti-TAT antibody or TAT polypeptide may be obtained from a cDNA library prepared 
from tissue believed to possess the anti-TAT antibody or TAT polypeptide mRNA and to express it at a 
detectable level. Accordingly, human anti-TAT antibody or TAT polypeptide DNA can be conveniendy 

35 obtained from a cDNA library prepared from human tissue. The anti-TAT antibody- or, TAT polypeptide- 
encoding gene may also be obtained from a genomic library or by known synthetic procedures (e.g. , automated 

291 

1/24/2006, EAST Version: 2.0.1.4 



WO 2004/030615 



PCT/US2003/028547 



nucleic acid synthesis). 

libraries can be screened with probes (such as oligonucleotides of at least about 20-80 bases) designed 
to identify the gene of interest or the protein encoded by it. Screening the cDNA or genomic library with the 
selected probe may be conducted using standard procedures, such as described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An alternative means 
to isolate the gene encoding anti-TAT antibody or TAT polypeptide is to use PCR methodology [Sambrook et 
al., supra: Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1995)]. 

Techniques for screening a cDNA library are well known in the art. The oligonucleotide sequences 
selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in the art, and include the use of radiolabels like 32 P- 
labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and 
high stringency, are provided in Sambrook et al., supra. 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDN A or genomic 
libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra, to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for anti- 
TAT antibody or TAT polypeptide production and cultured in conventional nutrient media modified as 
appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired 
sequences. The culture conditions, such as media, temperature, pH and the like, can be selected by die skilled 
artisan without undue experimentation. In general, principles, protocols, and practical techniques for 
maximizing the productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical 
Approach. M. Butler, ed. <JKL Press, 1991) and Sambrook et al., supra. 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaCl 2 , CaP0 4 , liposome-mediated and electroporation. Depending on the host ceil 
used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for 
prokaryotes. Infection with Agrobacteriwn tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al., Gene. 2^:315 (1983) and WO 89/05859 published 29 June 1989, For m am mal ia n 
cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology, 
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52:456-457 (1978) can be employed. General aspects of mammalian cell host system transitions have been 
described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the 
method of Van Solingen et al., J. Bact., 130:946 (1977) and Hsiao et al., Proc. Natl. Acad. Sci. (USA). 
76:3829 (1979). However, other methods for introducbg DNA into cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, orpolycations, e.g., polybrene, polyornithine, may 
5 also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods in 
Enzvmology. 185:527-537 (1990) and Mansour et al., Nature, 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 
or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 
or Gram«^)ositive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are 
10 publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31 ,537); E. coli 
strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, 
e.g., Salmonella typhirnuriwn, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. 
subtilis and B. licheniformis (e.g., B. Ucheniformis 41P disclosed in DD 266,710 published 12 April 1989), 
15 Pseudomonas such as P. aeruginosa, and Streptomyces. These examples are illustrative rather than limiting. 
Strain W3110 is one particularly preferred host or parent host because it is a common host strain for 
recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic 
enzymes. For example, strain W3110 may be modified to effect a genetic mutation in the genes encoding 
proteins endogenous to the host, with examples of such hosts including E. coli W3110 strain 1A2, which has 
20 the complete genotype tonA ; E. coli W3110 strain 9E4, which has the complete genotype tonA ptr3; E. coli 
W3 1 10 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP 
otnpT kan r ; E. coli W31 10 strain 37D6, which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 
degP ompT rbs7 ilvG kari; E. coli W31 10 strain 40B4, which is strain 37D6 with a non-kanamycin resistant 
degP deletion mutation; and an E. coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 
25 4,946,783 issued 7 August 1990. Alternatively, in vitro methods of cloning, e.g. , PCR or other nucleic acid 
polymerase reactions, are suitable. 

Full length antibody, antibody fragments, and antibody fusion proteins can be produced in bacteria, 
in particular when glycosylation and Fc effector function are not needed, such as when the therapeutic antibody 
is conjugated to a cytotoxic agent (e.g. , a toxin) and the immunoconjugate by itself shows effectiveness in tumor 
30 cell destruction. Full length antibodies have greater half life in circulation. Production in E. coli is faster and 
more cost efficient. For expression of antibody fragments and polypeptides in bacteria, see, e.g., U.S. 
5,648,237 (Carter et. al.), U.S. 5,789,199 (Joly et al.), and U.S. 5,840,523 (Simmons et al.) which describes 
translation initiation regio (TIR) and signal sequences for optimizing expression and secretion, these patents 
incorporated herein by reference. After expression, the antibody is isolated from the E. coli cell paste in a 
35 soluble fraction and can be purified through, e.g., a protein A or G column depending on the isotype. Final 
purification can be carried out similar to the process for purifying antibody expressed e.g,, in CHO cells. 
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In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
or expression hosts for anti-TAT antibody- or TAT polypeptide-encoding vectors. Saccharomyces cerevisiae 
is a commonly used lower eukaryotic host microorganism. Others inchxdeSchizosaccharomyces pornbe (Beach 
and Nurse, Nature, 290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent 
No. 4,943,529; Fleer et aL, Bio/Technology. 9:968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, 
5 CBS4574; LouvencourtetaL, J. BacterioL. 154(2):737-742 [1983]), K.fragilis (ATCC 12,424), K. bulgaricus 
(ATCC 16,045), K. wickeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; 
Van den Berg et aL, Bio/Technology. 8:135 (1990)), & thermotolerans, and K. marxiamis; yarrowia (EP 
402,226); Pickiapastoris (EP 183,070; Sreekrishna et aL, J. Basic Microbiol., 28:265-278 [1988]); Candida; 
Trichoderma reesia (EP 244,234); Neurospora crassa (Case et aL , Proc. Natl. Acad. Sci. USA. 76:5259-5263 

10 [1979]); Schwanmomyces such as Schwarmiomyces occidentalis (EP 394,538 published 31 October 1990); and 
filamentous fungi such as, e.g., Neurospora, Pemcillium, Totypocladium (WO 91/00357 published 10 January 
1991), zod Aspergillus hosts such as A. nidulans (Ballance et aL , Biochem. Biophvs. Res. Commun.. 1 12:284- 
289 [1983]; Tilburn et aL , Gene. 26:205-221 [1983]; Yeltonet aL, Proc. Natl. Acad. Sci. USA, 81: 1470-1474 
[1984]) andA niger (Kelly and Hynes, BMBOJ.. 4:475-479 [1985]). Methylotropic yeasts are suitable herein 

15 and include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of 
Hansemda, Candida, Kloeckera, Pichia, Saccharomyces, Tondopsis,mdRhodotorula. A list of specific species 
that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrop hs, 269 
(1982). 

Suitable host cells for the expression of glycosylated anti-TAT antibody or TAT polypeptide are derived 

20 from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and 
Spodoptera Sf9, as well as plant cells, such as cell cultures of cotton, corn, potato, soybean, petunia, tomato, 
and tobacco. Numerous baculo viral strains and variants and corresponding permissive insect host cells from 
hosts such as Spodoptera frugiperda (caterpillar), Aedes aegypti (mosquito), Aedes albopictus (mosquito), 
Drosophila melanogaster (fruitfly), and Bombyx mori have been identified. A variety of viral strains for 

25 transfection are publicly available, e.g. , the L-l variant of Autographa caUfornica NPV and the Bm-5 strain of 
Bombyx mori NPV, and such viruses may be used as the vims herein according to the present invention, 
particularly for transfection of Spodoptera frugiperda cells. 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture 
(tissue culture) has become a routine procedure. Examples of useful mammalian host cell lines are monkey 

30 kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 
cells subcloned for growth in suspension culture, Graham et al., J. Gen Virol. 36:59 (1977)); baby hamster 
kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary cells/-DHFR (CHO, Urlaub et al., Proc. Natl. 
Acad. Sci. USA 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod. 23:243-251 (1980)); monkey 
kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76, ATCC CRL- 1587); human 

35 cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat 
liver cells (BRL 3A, ATCC CRL 1442); human hing cells (W138, ATCC CCL 75); human liver cells (Hep G2, 
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HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et al., Annals N.Y, 
Acad. Sci. 383:44-68 (1982)); MRC 5 cells; FS4 cells; and a human hepatoma line (Hep G2). 

Host cells are transformed with the above-described expression or cloning vectors for anti-TAT 
antibody or TAT polypeptide production and cultured in conventional nutrient media modified as appropriate 
for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. 
3. Selection and Use of a Replicable Vector 
The nucleic acid (e.g. , cDNA or genomic DNA) encoding anti-TAT antibody or TAT polypeptide may 
be inserted into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors 
are publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or 
phage. The appropriate nucleic acid sequence may be imertedintothe vector by a variety of procedures. In 
general, DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. 
Vector components generally include, but are not limited to, one or more of a signal sequence, an origin of 
replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or more of these components employs standard 
ligation techniques which are known to the skilled artisan. 

The TAT may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component 
of die vector, or it may be a part of the anti-TAT antibody- or TAT polypeptide-encoding DNA that is inserted 
into the vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the 
group of the alkaline phosphatase, penicillinase, lpp, or heat-stable enterotoxin D leaders. For yeast secretion 
die signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharoinyces and 
Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010, 182), or acid phosphatase leader, 
the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), or the signal described in WO 
90/13646 published 15 November 1990. In mammalian cell expression, mammalian signal sequences may be 
used to direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or 
related species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2\i plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g. , ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical 
nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli 

An example of suitable selectable markers for mammalian cells are those that enable the identification 
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of cells competent to take up the anti-TAT antibody- or TAT polypeptide-encoding nucleic acid, such as DHFR 
or thymidine kinase. An appropriate host ceil when wild-type DHFR is employed is the CHO cell line deficient 
in DHFR activity, prepared and propagated as described by Urlaub et al. , Proc. Natl. Acad. Sci. USA, 77:4216 
(1980). A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 
[Stinchcomb et al., Nature. 282:39 (1979); Kingsman et al., Gene, 7:141 (1979); Tschemper et al., Gene, 
5 10: 157 (1980)] . The trpl gene provides a selection marker for a mutant strain of yeast lacking the ability to 
grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics. 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the anti-TAT antibody- 
or TAT polypeptide-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a 
variety of potential host cells are well known. Promoters suitable for use with prokaryotic hosts include thep- 

10 lactamase and lactose promoter systems [Chang etal., Nature. 275:615 (1978); Goeddel et al., Nature, 281:544 
(1979)], alkaline phosphatase, a tryptophan (trp) promoter system [GoeddeL Nucleic Acids Res. , 8:4057 (1980); 
EP 36,776], and hybrid promoters such as the tac promoter [deBoer et al. , Proc. Natl. Acad. Sci. USA. 80:21- 
25 (1983)]. Promoters for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably 
linked to the DNA encoding anti-TAT antibody or TAT polypeptide. 

15 Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

phosphoglycerate kinase [Hitzeman et al. , J.Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes [Hess 
et al., J. Adv. Enzyme Reg.. 7:149 (1968); Holland, Biochemistry, 17:4900 (1978)], such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose- 
6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 

20 isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable Vectors and 

25 promoters for use in yeast expression are further described in EP 73,657. 

Anti-TAT antibody or TAT polypeptide transcription from vectors in mammalian host cells is 
controlled, for example, by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox 
virus (UK 2,211,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian 
sarcoma virus, cytomegalovirus, a retrovirus, hepatitis~B virus and Simian Virus 40 (SV40), from heterologous 

30 mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock 
promoters, provided such promoters are compatible with the host cell systems. 

Transcription of a DNA encoding the anti-TAT antibody or TAT polypeptide by higher eukary otes may 
be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, 
usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences 

35 are now known from mammalian genes (giobin, elastase, albumin, a-fetoprotein, and insulin). Typically, 
however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the 
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late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into 
the vector at a position 5' or 3' to the anti-TAT antibody or TAT polypeptide coding sequence, but is preferably 
located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
5 nucleated cells from .other multicellular organisms) will also contain sequences necessary for the termination 
of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, 
occasionally 3 ? , untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding anti-TAT 
antibody or TAT polypeptide. 

1 0 Still other methods, vectors, and host cells suitable for adaptation to the synthesis of anti-TAT antibody 

or TAT polypeptide in recombinant vertebrate cell culture are described in Gething et al. , Nature. 293:620-625 
(1981); Mantei et al., Nature. 281:40-46 (1979); EP 117,060; and EP 117,058. 
4. Culturing the Host Cells 
The host cells used to produce the anti-TAT antibody or TAT polypeptide of this invention may be 

15 cultured in a variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential 
Medium ((MEM), (Sigma), RPMI-1640 (Sigma), and Dulbecco' s Modified Eagle's Medium ((DMEM), Sigma) 
are suitable for culturing the host cells. In addition, any of the media described in Ham et al Meth. Enz. 58:44 
(1979), Barnes et al., Anal. Biochem. 1 02:255 (1980), U.S. Pat. Nos. 4,767,704; 4,657,866; 4,927,762; 
4,560,655; or 5,122,469; WO 90/03430; WO 87/00195; or U.S. Patent Re. 30,985 may be used as culture 

20 media for the host cells. Any of these media may be supplemented as necessary with hormones and/or other 
growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as sodium chloride, 
calcium, magnesium, and phosphate), buffers (such as HEPES), nucleotides (such as adenosine and thymidine), 
antibiotics (such as GENTAMYCIN™ drug), trace elements (defined as inorganic compounds usually present 
at final concentrations in the micromolar range), and glucose or an equivalent energy source. Any other 

25 necessary supplements may also be included at appropriate concentrations that would be known to those skilled 
in the art. The culture conditions, such as temperature, pH, and the like, are those previously used with the host 
cell selected for expression, and will be apparent to the ordinarily skilled artisan. 
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5. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. 
Acad. Sci. USA , 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be 

5 employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid 
duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out 
where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of 
antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 

10 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, 
the antibodies may be prepared against a native sequence TAT polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to TAT DNA and encoding a 

15 specific antibody epitope. 

6. Purification of Anti-TAT Anrihnrtv and TAT Polypeptide 

Forms of anti-TAT antibody and TAT polypeptide may be recovered from culture medium or from host 
cell lysates. If membrane-bound, it can be released from die membrane using a suitable detergent solution (e.g. 
Triton-X 100) or by enzymatic cleavage. Cells employed in expression of anti-TAT antibody and TAT 

20 polypeptide can be disrupted by various physical or chemical means, such as freeze-thaw cycling, sonication, 
mechanical disruption, or cell lysing agents. 

It may be desired to purify anti-TAT antibody and TAT polypeptide from recombinant cell proteins 
or polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation 
on an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a 

25 cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel 
filtration using, for example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as 
IgG; and metal chelating columns to bind epitope-tagged forms of the anti-TAT antibody and TAT polypeptide. 
Various methods of protein purification may be employed and such methods are known in the art and described 
for example in Deutscher, Methods in Enzvmologv. 182 (1990); Scopes, Protein Purification: Principles and 

30 Practice, Springer-Verlag, New York (1982). The purification step(s) selected will depend, for example, on 
the nature of the production process used and the particular anti-TAT antibody or TAT polypeptide produced. 

When using recombinant techniques, the antibody can be produced intracellular^ , in the periplasmic 
space, or directly secreted into the medium. If the antibody is produced intracellularly, as a first step, the 
particulate debris, either host cells or lysed fragments, are removed, for example, by centrifugation or 

35 ultrafiltration. Carter et al. , Bio/Technology 10: 163-167 (1992) describe a procedure for isolating antibodies 
which are secreted to the periplasmic space of E. coll Briefly, cell paste is thawed in the presence of sodium 
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acetate (pH 3.5), EDTA, and phenylmethylsulfonylfluoride (PMSF) over about 30 min. Cell debris can be 
removed by centrifugation. Where the antibody is secreted into die medium, supernatants from such expression 
systems are generally first concentrated using a commercially available protein concentration filter, for example, 
an Amicon or Miilipore Pellicon ultrafiltration unit. A protease inhibitor such as PMSF may be included in any 
of the foregoing steps to inhibit proteolysis and antibiotics may be included to prevent the growth of adventitious 
5 contaminants. 

The antibody composition prepared from the cells can be purified using, for example, hydroxylapatite 
chromatography, gel electrophoresis, dialysis, and affinity chromatography, with affinity chromatography being 
the preferred purification technique. The suitability of protein A as an affinity ligand depends on the species 
and isotype of any immunoglobulin Fc domain that is present in the antibody. Protein A can be used to purify 

10 antibodies, that are based on human yl, y2 or y4 heavy chains (Lindmark et al., J. Immunol. Meth. 62:1-13 
(1983)). Protein G is recommended for all mouse isotypes and for human y3 (Guss et al., EMBO J. 
5:15671575 (1986)). The matrix to which the affinity ligand is attached is most often agarose, but other 
matrices are available. Mechanically stable matrices such as controlled pore glass or 
poly(styrenedivinyl)benzene allow for faster flow rates and shorter processing times than can be achieved with 

15 agarose. Where the antibody comprises a Q3 domain, the Bakerbond ABX™resin (J. T. Baker, Phillipsburg, 
N J) is useful for purification. Other techniques for protein purification such as fractionation on an ion-exchange 
column, ethanol precipitation, Reverse Phase HPLC, chromatography on silica, chromatography on heparin 
SEPHAROSE™ chromatography on an anion or cation exchange resin (such as a polyaspartic acid column), 
chromatof ocusing, SDS-PAGE, and ammonium sulfate precipitation are also available depending on the antibody 

20 to be recovered. 

Following any preliminary purification step(s), the mixture comprising the antibody of interest and 
contaminants may be subjected to low pH hydrophobic interaction chromatography using an elution buffer at 
a pH between about 2.5-4.5, preferably performed at low salt concentrations (e.g., from about 0-0.25M salt). 
J. Pharmaceutical Formulations 

25 Therapeutic formulations of the anti-TAT antibodies, TAT binding oligopeptides, TAT binding organic 

molecules and/or TAT polypeptides used in accordance with the present invention are prepared for storage by 
mixing the antibody, polypeptide, oligopeptide or organic molecule having the desired degree of purity with 
optional pharmaceutically acceptable carriers, excipients or stabilizers ( Remington*s Pharmaceutical Sciences 
16th edition, Osol, A. Ed. (1980)), in the form of lyophilized formulations or aqueous solutions. Acceptable 

30 carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and 
include buffers such as acetate, Tris, phosphate, citrate, and other organic acids; antioxidants including ascorbic 
acid and methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; hexamethonium 
chloride; benzalkjomum chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens such 
as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular 

35 weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or iinmunoglobulins; 
hydrophilic polymers such as polyvinylpynolidone; amino acids such as glycine, glutamine, asparagine, 
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histtdine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, 
mannose, or dextrins; chelating agents such as EDTA; tonicifiers such as trehalose and sodium chloride; sugars 
such as sucrose, mannitol, trehalose or sorbitol; surfactant such as polysorbate; salt-forming counter-ions such 
as sodium; metal complexes (e.g., Zn-protein complexes); and/or non-ionic surfactants such as TWEEN®, 
PLURONICS® or polyethylene glycol (PEG). The antibody preferably comprises the antibody at a 
5 concentration of between 5-200 mg/ml, preferably between 10-100 mg/ml. 

The formulations herein may also contain more than one active compound as necessary for the 
particular indication being treated, preferably those with complementary activities that do not adversely affect 
each other. For example, in addition to an anti-TAT antibody, TAT binding oligopeptide, or TAT binding 
organic molecule, it may be desirable to include in the one formulation, an additional antibody, e.g. , a second 

10 anti-TAT antibody which binds a different epitope on the TAT polypeptide, or an antibody to some other target 
such as a growth factor that affects the growth of the particular cancer. Alternatively, or additionally, the 
composition may further comprise achemotherapeutic agent, cytotoxic agent, cytokine, growth inhibitory agent, 
anti-hormonal agent, and/or cardioprotectant. Such molecules are suitably present in combination in amounts 
that are effective for the purpose intended. 

1 5 The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 

techniques or by interfacial polymerization, for example, hydroxymethylcelhilose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules) or in macroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences. 16th edition, Osol, A. Ed. (1980). 

20 Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 

include semi-permeable matrices of solid hydrophobic polymers containing the antibody, which matrices are 
in the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides 
(U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 

25 vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT® (injectable 

microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 
hydroxybutyric acid. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished 
by filtration through sterile filtration membranes. 
30 K. Diagnosis and Treatment with Anti-TAT Antibodies. TAT Binding Ol igopeptides and TAT 

Binding Organic Molecules 
To determine TAT expression in the cancer, various diagnostic assays are available. In one 
embodiment, TAT polypeptide overexpression may be analyzed by immunohistochemistry (IHC). Parrafin 
embedded tissue sections from a tumor biopsy may be subjected to the IHC assay and accorded a TAT protein 
35 staining intensity criteria as follows: 
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Score 0 - no staining is observed or membrane staining is observed in less than 10% of tumor cells. 

Score 1 4- - a feint/barely perceptible membrane staining is detected in more than 10% of the tumor 
cells. The cells are only stained in part of their membrane. 

Score 2+ - a weak to moderate complete membrane staining is observed in more than 10% of the 
tumor cells. 

Score 3+ - a moderate to strong complete membrane staining is observed in more than 10% of the 
tumor cells. 

Those tumors with 0 or 1+ scores for TAT polypeptide expression may be characterized as not 
overexpressing TAT, whereas those tumors with 2+ or 3+ scores may be characterized as overexpressing 
TAT. 

Alternatively, or additionaUy, FISH assays such as the INFORM® (sold by Ventana, Arizona) or 
PATHVISION® (Vysis, Illinois) may be carried out on formalin-fixed, paraffin-embedded tumor tissue to 
determine die extent (if any) of TAT overexpression in the tumor. 

TAT overexpression or amplification may be evaluated using an in vivo diagnostic assay, e.g., by 
administering a molecule (such as an antibody, oligopeptide or organic molecule) which binds the molecule to 
be detected and is tagged with a detectable label (e.g . , a radioactive isotope or a fluorescent label) and externally 
scanning the patient for localization of the label. 

As described above, the anti-TAT antibodies, oligopeptides and organic molecules of the inventionhave 
various non-therapeutic applications. The anti-TAT antibodies, oligopeptides and organic molecules of the 
present invention can be useful for diagnosis and staging of TAT polypeptide-expressing cancers (e.g., in 
radioimaging). The antibodies, oligopeptides and organic molecules are also useful for purification or 
immunoprecipitation of TAT polypeptide from cells, for detection and quantitation of TAT polypeptide in vitro, 
e.g. , in an ELISA or a Western blot, to kill and eliminate TAT-expressing cells from a population of mixed cells 
as a step in the purification of other cells. 

Currently, depending on die stage of the cancer, cancer treatment involves one or a combination of the 
following therapies: surgery to remove the cancerous tissue, radiation therapy, and chemotherapy. Anti-TAT 
antibody, oligopeptide or organic molecule therapy may be especially desirable in elderly patients who do not 
tolerate the toxicity and side effects of chemotherapy well and in metastatic disease where radiation therapy has 
limited usefulness. The tumor targeting anti-TAT antibodies, oligopeptides and organic molecules of the 
invention are useful to alleviate TAT-expressing cancers upon initial diagnosis of the disease or during relapse. 
For therapeutic applications, the anti-TAT antibody, oligopeptide or organic molecule can be used alone, or in 
combination therapy with, e.g., hormones, antiangiogens, or radiolabeled compounds, or with surgery, 
cryotherapy, and/or radiotherapy. Anti-TAT antibody, oligopeptide or organic molecule treatment can be 
administe red in conjunction with other forms of conventional therapy, either consecutively with, pre- or post- 
conventional therapy. Chemotherapeutic drugs such as TAXOTERE® (docetaxel), TAXOL® (palictaxel), 
estramustine and mitoxantrone are used in treating cancer, in particular, in good risk patients. In the present 
method of the invention for treating or alleviating cancer, the cancer patient can be administered anti-TAT 
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antibody, oligopeptide or organic molecule in conjuction with treatment with the one or more of the preceding 
chemotherapeutic agents. In particular, combination therapy withpalictaxel andmodified derivatives (see, e.g., 
EP0600517) is contemplated. The anti-TAT antibody, oligopeptide or organic molecule will be administered 
with a therapeutically effective dose of the chemotherapeutic agent. In another embodiment, the anti-TAT 
antibody, oligopeptide or organic molecule is administered in conjunction with chemotherapy to enhance the 
5 activity and efficacy of the chemotherapeutic agent, e.g. , paclitaxel. The Physicians* Desk Reference (PDR) 
discloses dosages of these agents that have been used in treatment of various cancers. The dosing regimen and 
dosages of these aforementioned chemotherapeutic drugs that are therapeutically effective will depend on die 
particular cancer being treated, the extent of the disease and other factors familiar to the physician of sldll in 
the art and can be determined by the physician. 

10 In one particular embodiment, a conjugate comprising an anti-TAT antibody, oligopeptide or organic 

molecule conjugated with a cytotoxic agent is administered to the patient. Preferably, the immunoconjugate 
bound to the TAT protein is internalized by the cell, resulting in increased therapeutic efficacy of the 
immunoconjugate in Willing the cancer cell to which it binds. In a preferred embodiment, the cytotoxic agent 
targets or interferes with die nucleic acid in the cancer cell. Examples of such cytotoxic agents are described 

15 above and include maytansinoids, calicheamicins, ribonucleases and DNA endonucleases. 

The anti-TAT antibodies, oligopeptides, organic molecules or toxin conjugates thereof are administered 
to a human patient, in accord with known methods, such as intravenous administration, e.g.,, as a bolus or by 
continuous infusion over a period of time, by intramuscular, intraperitoneal, intracerobrospinal, subcutaneous, 
intra-articular, intrasynovial, intrathecal, oral, topical, or inhalation routes. Intravenous or subcutaneous 

20 administration of the antibody, oligopeptide or organic molecule is preferred. 

Other therapeutic regimens may be combined with the administration of the anti-TAT antibody, 
oligopeptide or organic molecule. The combined administration includes co-adniinistration, using separate 
formulations or a single pharmaceutical formulation, and consecutive administration in either order, wherein 
preferably there is a time period while both (or all) active agents simultaneously exert their biological activities. 

25 Preferably such combined therapy results in a synergistic therapeutic effect. 

It may also be desirable to combine administration of the anti-TAT antibody or antibodies, oligopeptides 
or organic molecules, with administration of an antibody directed against another tumor antigen associated with 
the particular cancer. 

In another embodiment, the therapeutic treatment methods of the present invention involves die 
30 combined administration of an anti-TAT antibody (or antibodies), oligopeptides or organic molecules and one 
or more chemotherapeutic agents or growth inhibitory agents, including co-administration of cocktails of 
different chemotherapeutic agents. Chemotherapeutic agents include estramustine phosphate, prednimustine, 
cisplatin, 5-Quorouracil, melphalan, cyclophosphamide, hydroxyurea andhydroxyureataxanes (such as paclitaxel 
and doxetaxel) and/or anthracycline antibiotics. Preparation and dosing schedules for such chemotherapeutic 
35 agents may be used according to manufacturers' instructions or as determined empirically by the skilled 
practitioner. Preparation and dosing schedules for such chemotherapy are also described in Chemotherapy 
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Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). 

The antibody, oligopeptide or organic molecule may be combined with an anti-hormonal compound; 
e.g., an anti-estrogen compound such as tamoxifen; an anti-progesterone such as onapristone (see, EP 616 812); 
or an anti-androgen such as flutamide, in dosages known for such molecules. Where the cancer to be treated 
is androgen independent cancer, the patient may previously have been subjected to anti-androgen therapy and, 
after the cancer becomes androgen independent, the anti-TAT antibody, oligopeptide or organic molecule (and 
optionally other agents as described herein) may be administered to the patient. 

Sometimes, it may be beneficial to also co-administer a cardioprotectant (to prevent or reduce 
myocardial dysfunction associated with the therapy) or one or more cytokines to the patient. In addition to the 
above therapeutic regimes, the patient may be subjected to surgical removal of cancer cells and/or radiation 
therapy, before, simultaneously with, or post antibody,' oligopeptide or organic molecule therapy. Suitable 
dosages for any of the above co-administered agents are those presently used and may be lowered due to the 
combined action (synergy) of the agent and anti-TAT antibody, oligopeptide or organic molecule. 

For the prevention or treatment of disease, the dosage and mode of administration will be chosen by 
the physician according to known criteria. The appropriate dosage of antibody, oligopeptide or organic molecule 
will depend on the type of disease to be treated, as defined above, the severity and course of the disease, 
whether the antibody, oligopeptide or organic molecule is administered for preventive or therapeutic purposes, 
previous therapy, the patient's clinical history and response to the antibody, oligopeptide or organic molecule, 
and the discretion of the attending physician. The antibody, oligopeptide or organic molecule is suitably 
administered to the patient at one time or over a series of treatments. Preferably, the antibody, oligopeptide 
or organic molecule is administered by intravenous infusion or by subcutaneous injections. Depending on the 
type and severity of die disease, about 1 u.g/kg to about 50 mg/kg body weight (e.g., about 0.1-15mg/kg/dose) 
of antibody can be an initial candidate dosage for adimnistration to the patient, whether, for example, by one 
or more separate administrations, or by continuous infusion. A dosing regimen can comprise administering an 
initial loading dose of about 4 mg/kg, followed by a weekly maintenance dose of about 2 mg/kg of die anti-TAT 
antibody. However, other dosage regimens may be useful. A typical daily dosage might range from about 1 
lig/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated administrations over 
several days or longer, depending on the condition, the treatment is sustained until a desired suppression of 
disease symptoms occurs. The progress of this therapy can be readily monitored by conventional methods and 
assays and based on criteria known to the physician or other persons of skill in die art. 

Aside from administration of die antibody protein to die patient, the present application contemplates 
ad^ninistration of the antibody by gene therapy. Such administration of nucleic acid encoding the antibody is 
encompassed by the expression "administering a therapeutically effective amount of an antibody". See, for 
example, WO96/07321 published March 14, 1996 concerning the use of gene therapy to generate intracellular 
antibodies. 

There are two major approaches to getting the nucleic acid (optionally contained in a vector) into die 
patient's cells; in vivo and ex vivo. For in vivo delivery the nucleic acid is injected directly into the patient, 
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usually at the site where the antibody is required. For ex vivo treatment, the patient's cells are removed, the 
nucleic acid is introduced into these isolated cells and the modified cells are administered to the patient either 
direcdy or, for example, encapsulated within porous membranes which are implanted into the patient (see, e.g., 
U.S. Patent Nos. 4,892,538 and 5,283, 187). There are a variety of techniques available for introducing nucleic 
acids into viable cells. The techniques vary depending upon whether the nucleic acid is transferred into cultured 
5 cells in vitro, or in vivo in die cells of the intended host. Techniques suitable for the transfer of nucleic acid 
into m a mmal ian cells in vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE- 
dextran, die calcium phosphate precipitation method, etc. A commonly used vector forex vivo delivery of the 
gene is a retroviral vector. 

The currently preferred in vivo nucleic acid transfer techniques include transf ection with viral vectors 
10 (such as adenovirus, Herpes simplex I vims, or adeno-associated virus) and lipid-based systems (useful lipids 
for lipid-mediated transfer of the gene are DOTMA, DOPE and DC-Choi, for example). For review of the 
currently known gene marking and gene therapy protocols see Anderson et aL, Science 256:808-813 (1992). 
See also WO 93/25673 and the references cited therein. 

The anti-TAT antibodies of the invention can be in the different forms encompassed by the definition 
15 of "antibody" herein. Thus, the antibodies include full length or intact antibody, antibody fragments, native 
sequence antibody or amino acid variants, humanized, chimeric or fusion antibodies, immunoconjugates, and 
functional fragments thereof. In fusion antibodies an antibody sequence is fused to a heterologous polypeptide 
sequence. The antibodies can be modified in the Fc region to provide desired effector functions. As discussed 
in more detail in the sections herein, with the appropriate Fc regions, the naked antibody bound on the cell 
20 surface can induce cytotoxicity, e.g., via antibody-dependent cellular cytotoxicity (ADCC) or by recruiting 
complement in complement dependent cytotoxicity, or some other mechanism. Alternatively, where it is 
desirable to eliminate or reduce effector function, so as to minimize side effects or therapeutic complications, 
certain other Fc regions may be used. 

In one embodiment, the antibody competes for binding or bind substantially to, the same epitope as the 
25 antibodies of the invention. Antibodies having the biological characteristics of the present anti-TAT antibodies 
of the invention are also contemplated, specifically including the in vivo tumor targeting and any cell 
proliferation inhibition or cytotoxic characteristics. 

Methods of producing the above antibodies are described in detail herein. 

The present anti-TAT antibodies, oligopeptides and organic molecules are useful for treating a TAT- 
30 expressing cancer or alleviating one or more symptoms of the cancer in a mammal. Such a cancer includes 
prostate cancer, cancer of the urinary tract, lung cancer, breast cancer, colon cancer and ovarian cancer, more 
specifically, prostate adenocarcinoma, renal cell carcinomas, colorectal adenocarcinomas, lung 
adenocarcinomas, lung squamous cell carcinomas, and pleural mesothelioma. The cancers encompass metastatic 
cancers of any of the preceding. The antibody, oligopeptide or organic molecule is able to bind to at least a 
35 portion of the cancer cells that express TAT polypeptide in the mammal. In a preferred embodiment, the 
antibody, oligopeptide or organic molecule is effective to destroy or kill TAT-expressing tumor cells or inhibit 
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the growth of such tumor cells, in vitro or in vivo, upon binding to TAT polypeptide on the cell. Such an 
antibody includes a naked anti-TAT antibody (not conjugated to any agent). Naked antibodies that have 
cytotoxic or cell growth inhibition properties can be further harnessed with a cytotoxic agent to render them 
even more potent in tumor cell destruction. Cytotoxic properties can be conferred to an anti-TAT antibody by , 
e.g., conjugating the antibody with a cytotoxic agent, to form an immunoconjugate as described herein. The 
cytotoxic agent or a growth inhibitory agent is preferably a small molecule. Toxins such as calicheamicin or 
a maytansinoid and analogs or derivatives thereof, are preferable. 

The invention provides a composition comprising an anti-TAT antibody, oligopeptide or organic 
molecule of the invention, and a carrier. For the purposes of treating cancer, compositions can be administered 
to the patient in need of such treatment, wherein the composition can comprise one or more anti-TAT antibodies 
present as an immunoconjugate or as the naked antibody. In a further embodiment, the compositions can 
comprise these antibodies, oligopeptides or organic molecules in combination with other therapeutic agents such 
as cytotoxic or growth inhibitory agents, including chemotherapeutic agents. The invention also provides 
formulations comprising an anti-TAT antibody, oligopeptide or organic molecule of the invention, and a carrier. 
In one embodiment, the formulation is a therapeutic formulation comprising a pharmaceutically acceptable 
carrier. 

Another aspect of the invention is isolated nucleic acids encoding the anti-TAT antibodies. Nucleic 
acids encoding both the H and L chains and especially the hypervariable region residues, chains which encode 
the native sequence antibody as well as variants, modifications and humanized versions of the antibody, are 
encompassed. 

The invention also provides methods useful for treating a TAT polypeptide-expressing cancer or 
alleviating one or more symptoms of the cancer in a mammal, comprising administering a therapeutically 
effective amount of an anti-TAT antibody, oligopeptide or organic molecule to the mammal. The antibody, 
oligopeptide or organic molecule therapeutic compositions can be administered short term (acute) or chronic, 
or intermittent as directed by physician. Also provided are methods of inhibiting the growth of, and killing a 
TAT polypeptide-expressing cell. 

The invention also provides kits and articles of manufacture comprising at least one anti-TAT antibody, 
oligopeptide or organic molecule. Kits containing anti-TAT antibodies, oligopeptides or organic molecules find 
use, e.g. , for TAT cell killing assays, for purification or immunoprecipitation of TAT polypeptide from cells. 
For example, for isolation and purification of TAT, the kit can contain an anti-TAT antibody, oligopeptide or 
organic molecule coupled to beads (e.g., sepharose beads). Kits can be provided which contain the antibodies, 
oligopeptides or organic molecules for detection and quantitation of TAT in vitro , e.g., in an ELISA or a 
Western blot. Such antibody, oligopeptide or organic molecule useful for detection may be provided with a 
label such as a fluorescent or radiolabel. 

L. Articles of Manufacture and Kits 

Another embodiment of the invention is an article of manufacture containing materials useful for the 
treatment of anti-TAT expressing cancer. The article of manufacture comprises a container and a label or 
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package insert on or associated with the container. Suitable containers include, for example, bottles, vials, 
syringes, etc. The containers may be formed from a variety of materials such as glass or plastic. The container 
holds a composition which is effective for treating the cancer condition and may have a sterile access port (for 
example the container may be an intravenous solution bag or a vial having a stopper pierceable by a hypodermic 
injection needle). At least one active agent in the composition is an anti-TAT antibody, oligopeptide or organic 
5 molecule of the invention. The label or package insert indicates that the composition is used for treating cancer. 
The label or package insert will further comprise instructions for adtninistering the antibody, oligopeptide or 
organic molecule composition to the cancer patient. Additionally, the article of manufacture may further 
comprise a second container comprising a pharmaceutically-acceptable buffer, such as bacteriostatic water for 
injection (BWFI), phosphate-buffered saline, Ringer's solution and dextrose solution. It may further include 
10 other materials desirable from a commercial and user standpoint, including other buffers, diluents, filters, 
needles, and syringes. 

Kits are also provided that are useful for various purposes , e.g., for TAT-expressing cell killing 
assays, for purification or immunoprecipitation of TAT polypeptide from cells. For isolation and purification 
of TAT polypeptide, the kit can contain an anti-TAT antibody, oligopeptide or organic molecule coupled to 

15 beads (e.g., sepharose beads). Kits can be provided which contain the antibodies, oligopeptides or organic 
molecules for detection and quantitation of TAT polypeptide in vitro, e.g. , in an ELISA or a Western blot. As 
with the article of manufacture, the kit comprises a container-and a label or package insert on or associated with 
the container. The container holds a composition comprising at least one anti-TAT antibody, oligopeptide or 
organic molecule of die invention. Additional containers may be included that contain, e.g., diluents and 

20 buffers, control antibodies. The label or package insert may provide a description of the composition as well 
as instructions for the intended in vitro or diagnostic use. 

M. Uses for TAT Polypeptides and TAT-Polypeptide Encoding Nucleic Acids 
Nucleotide sequences (or their complement) encoding TAT polypeptides have various applications in 
the art of molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in 

25 the generation of anti-sense RNA and DNA probes. TAT-encoding nucleic acid will also be useful for die 
preparation of TAT polypeptides by the recombinant techniques described herein, wherein those TAT 
polypeptides may find use, for example, in the preparation of anti-TAT antibodies as described herein. 

The full-length native sequence TAT gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the fulHength TAT cDNA or to isolate still other cDNAs (for instance, those 

30 encoding naturally-occurring variants of TAT or TAT from other species) which have a desired sequence 
identity to the native TAT sequence disclosed herein. Optionally, the length of the probes will be about 20 to 
about 50 bases. The hybridization probes may be derived from at least partially novel regions of the full length 
native nucleotide sequence wherein those regions may be determined without undue experimentation or from 
genomic sequences including promoters, enhancer elements and introns of native sequence TAT. By way of 

3 5 example, a screening method will comprise isolating the coding region of the TAT gene using the known DNA 
sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety 
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of labels, including radionucleotides such as 3 P or 35 S , or enzymatic labels such as alkaline phosphatase coupled 
to the probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of 
the TAT gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA 
to determine which members of such libraries the probe hybridizes to. Hybridization techniques are described 
in further detail in the Examples below. Any EST sequences disclosed in the present application may similarly 
be employed as probes, using the methods disclosed herein. 

Other useful fragments of the TAT-encoding nucleic acids include antisense or sense oligonucleotides 
comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target TAT 
mRNA (sense) or TAT DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment of the coding region of TAT DNA. Such a fragment generally comprises 
at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or 
a sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, 
Stein and Cohen ( Cancer Res. 48:2659, 1988) and van der Krol et al. ( BioTechniques 6:958, 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in,the formation 
of duplexes that block: transcription or translation of the target sequence by one of several means, including 
enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 
Such methods are encompassed by the present invention. The antisense oligonucleotides thus may be used to 
block expression of TAT proteins, wherein those TAT proteins may play a role in the induction of cancer in 
mammals. Antisense or sense oligonucleotides further comprise oligonucleotides having modified sugar- 
phosphodiester backbones (or other sugar linkages, such as those described in WO 91/06629) and wherein such 
sugar linkages are resistant to endogenous nucleases. Such oligonucleotides with resistant sugar linkages are 
stable in vivo (i.e. , capable of resisting enzymatic degradation) but retain sequence specificity to be able to bind 
to target nucleotide sequences. 

Preferred intragenic sites for antisense binding include the region incorporating the translation 
initiation/start codon (5'-AUG / 5'-ATG) or termination/stop codon (5*-UAA, 5'-UAG and 5-UGA / 5'-TAA, 
5'-TAG and 5'-TGA) of the open reading frame (ORF) of the gene. These regions refer to a portion of the 
mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5* 
or 3') from a translation initiation or termination codon. Other preferred regions for antisense binding include: 
introns; exons; intron-exon junctions; the open reading frame (ORF) or "coding region," which is the region 
between the translation initiation codon and the translation termination codon; the 5 1 cap of an mRNA which 
comprises an N7-methylated guanosine residue joined to the 5* -most residue of the mRNA via a 5' -5* 
triphosphate linkage and includes 5' cap structure itself as well as the first 50 nucleotides adjacent to the cap; 
the 5' untranslated region (5'UTR), the portion of an mRNA in die 5' direction from the translation initiation 
codon, and thus including nucleotides between the 5* cap site and the translation initiation codon of an mRNA 
or corresponding nucleotides on the gene; and the 3* untranslated region (3'UTR), the portion of an mRNA in 
the 3* direction from the translation termination codon, and thus including nucleotides between the translation 
termination codon and 3* end of an mRNA or corresponding nucleotides on the gene. 
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Specific examples of preferred antisense compounds useful for inhibiting expression of TAT proteins 
include oligonucleotides containing modified backbones or non-natural internucleoside linkages. Oligonucleotides 
having modified backbones include those that retain a phosphorus atom in the backbone and those that do not 
have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced 
in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can 
5 also be considered to be oligonucleosides. Preferred modified oligonucleotide backbones include, for example, 
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotri-esters, 
methyl and other alkyl phosphonates including 3'-alkylene phosphonates, S'-alkylene phosphonates and chiral 
phosphonates, phosphinates, phosphoramidates including S'-amino phosphoramidate and 
aminoalkylphosphoramidates, thionophosphoramidates, thionoaiky lphosphonates , thionoalkylphosphotriesters, 

10 selenophosphates and borano-phosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those 
having inverted polarity wherein one or more internucleotide linkages is a 3' to 3', 5 T to 5 f or 2' to T linkage. 
Preferred oligonucleotides having inverted polarity comprise a single 3* to 3' linkage at the 3 '-most 
internucleotide linkage i.e. a single inverted nucleoside residue which may be abasic (the nucleobase is missing 
or has a hydroxyl group in place thereof). Various salts, mixed salts and free acid forms are also included. 

15 Representative United States patents that teach the preparation of phosphorus-containing linkages include, but 
are not limited to, U.S. Pat. Nos.: 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 
5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 
5,194,599; 5,565,555; 5,527,899; 5,721,218; 5,672,697 and 5,625,050, each of which is herein incorporated 

20 by reference. 

Preferred modified oligonucleotide backbones that do not include a phosphorus atom therein have 
backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and 
alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic 
internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion 

25 of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl 
backbones; methylene formacetyl and thioformacetyl backbones; riboacetyi backbones; alkene containing 
backbones; sulfamate backbones; memyleneimino and memylenehydrazino backbones; sulfonate and sulfonamide 
backbones; amide backbones; and others having mixed N, O, S and CH.sub.2 component parts. Representative 
United States patents that teach the preparation of such oligonucleosides include, but are not limited to,. U.S. 

30 Pat. Nos.: 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 
5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 
5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; 
5,792,608; 5,646,269 and 5,677,439, each of which is herein incorporated by reference. 

In other preferred antisense oligonucleotides, both the sugar and the internucleoside linkage, i.e., the 

3 5 backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization 
with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic 
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that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). 
In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, 
in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly 
to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the 
preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos.: 5,539,082; 5,714,331; and 
5 5,719,262, eachofwhich is herein incorporated by reference. Further teaching of PNA compounds can be found 
in Nielsen et al., Science, 1991, 254, 1497-1500. 

Preferred antisense oligonucleotides incorporate phosphorothioate backbones and/or heteroatom 
backbones, and in particular -CHa-NH-O-CIV, -CH2-N(CH 3 )-0-CH 2 - [known as a methylene (methylimino) 
or MMI backbone], -CHrO-NCCH^-CH^ -CH r N(CH 3 )-N(CH 3 )-CH r and -O-NCCH^H^H,- [wherein 
10 the native phosphodiester backbone is represented as -0-P-0-CH 2 ] described in the above referenced U.S. Pat. 
No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also preferred are 
antisense oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 
5,034,506. 

15 Modified oligonucleotides may also contain one or more substituted sugar moieties. Preferred 

oligonucleotides comprise one of the following at the 2' position: OH; F; O-alkyl, S-alkyl, or N-alkyl; O- 
alkenyl, S-alkeynyl, or N-alkenyl; O-alkynyl, S-alkynyl or N-alkynyl; or O-alkyl-O-alkyi, wherein the alkyl, 
alkenyl and alkynyl may be substituted or unsubstituted C , to C I0 alkyl or to C I0 alkenyl and alkynyl. 
Particularly preferred are 0[<CH^O] m CH 3 , 0(CB^ n OCH 3t OCCH^NH^ 0(CU£ a CH 3 , OCCH^ONH^ and 

20 OCCH^pNKCiy^H^, where n and m are from 1 to about 10. Other preferred antisense oligonucleotides 
comprise one of the following at die T position: Cj to C l0 lower alkyl, substituted lower alkyl, alkenyl, alkynyl, 
alkaryU aralkyl, O-alkaryl or O-aralkyl, SH, SCH 3 , OCN, Q, Br, CN, CF 3 , OCF 3 , SOCH3, S0 2 CH 3 , ON0 2 , 
N0 2 , N 3 , NHj, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an 
RNA cleaving group, a reporter group, an intercalated, a group for improving the pharmacokinetic properties 

25 of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and 
other substituents having similar properties. A preferred modification includes 2'-methoxyethoxy 
^-O-CHjC^OCH^also^ Chim. Acta, 1995, 

78, 486-504) i.e., an alkoxyalkoxy group. A further preferred modification includes 
2'-<iim^ylaminooxyetiioxy, i.e., aO(CH ^ONfCH^ group, also known as 2'DMAOE, as described in 

30 examples hereinbelow, and 2 , -dimethyIammoethoxyethoxy (also known in the art as 
2'-0-dimethylaminoe<hoxyethylor2 , -DMAEOE), i.e., 2'-0-CH r O-CH 2 -N(CH 2 ). 

A further prefered modification includes Locked Nucleic Acids (LNAs) in which the 2 T -hydroxy 1 group 
is linked to the 3' or 4* carbon atom of tiie sugar ring thereby forming a bicyclic sugar moiety. The linkage is 
preferably a methelyne (-CH 2 -) n group bridging the T oxygen atom and the 4' carbon atom wherein n is 1 or 
35 2. LNAs and preparation thereof are described in WO 98/39352 and WO 99/14226. 

Other preferred modifications include 2'-methoxy (2'-0-CH 3 ), 2'-aminopropoxy (2 , -OCH 2 CH 2 CH 2 
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NH2X 2'-allyl (2 , -CH 2 -CH=CH 2 ), 2'-0-allyl (2 , -0-CH 2 -CH=CH 2 ) and 2'-fluoro (2'-F). The 2'-modification 
may be in the arabino (up) position or ribo (down) position. A preferred 2 '-arabino modification is T -F. Similar 
modifications may also be made at other positions on the oligonucleotide, particularly the 3 ' position of the sugar 
on the 3' terminal nucleotide or in 2 ? -5* linked oligonucleotides and the 5' position of 5 1 terminal nucleotide. 
Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. 
5 Representative United States patents that teach the preparation of such modified sugar structures include, but 
are not limited to, U.S. Pat. Nos.: 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 
5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 
5,639,873; 5,646,265; 5,658,873; 5,670,633; 5,792,747; and 5,700,920, each of which is herein incorporated 
by reference in its entirety. 

10 Oligonucleotides may also include nucleobase (often referred to in the art simply as "base") 

modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases include the purine bases 
adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (Q and uracil (U). Modified 
nucleobases include other synthetic and natural nucleobases such as 5-methyicytosine (5-me-Q, 
5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of 

15 adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine 
and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (-C^C-CH 3 or -CH 2 -C^CH) uracil and cytosine and 
other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 

4- thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 

5- halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine 
20 and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 

7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic 
pyrimidines such as phenoxazine cytidme(lH-pyrimido[5,44)][l,4]benzoxazin-2(3H)-one), phenothiazine 
cytidine (lH-pyrimido[5,4-b][l ,4]benzothiazin-2(3H>-one), G-clamps such as asubstituted phenoxazine cytidine 
(e.g. 9K2-aminoethoxy)-H-pyrimido[5,4-b][l,4]benzoxazin-2(3H)-one), carbazole cytidine 

25 (2H-pyriinido[4,5-b]indol-2-one), pyridoindole cytidine (H-pyridoP\2^4,5]pyirolo[2,3^]pyrimidin-2-one). 
Modified nucleobases may also include those in which the purine or pyrimidine base is replaced with other 
heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further 
nucleobases include those disclosed in U.S. Pat. No, 3,687,808, those disclosed in The Concise Encyclopedia 
Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, and 

30 those disclosed by Englisch et ah, Angewandte Chemie, International Edition, 1991, 30, 613. Certain of these 
nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the 
invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted purines, 
including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have 
been shown to increase nucleic acid duplex stability by 0.6-1.2.dpgree. C. (Sanghvi et al, Antisense Research 

35 and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are preferred base substitutions, even more 
particularly when combined with 2 , -0-methoxyethyl sugar modifications. Representative United States patents 
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that teach the preparation of modified nucleobases include, but are not limited to: U.S. Pat. No. 3,687,808, as 
well as U.S. Pat. Nos.: 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 
5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 
5,645,985; 5,830,653; 5,763,588; 6,005,096; 5,681,941 and 5,750,692, each of which is herein incoiporated 
by reference. 

5 Another modification of antisense oligonucleotides chemically linking to the oligonucleotide one or 

more moieties or conjugates which enhance the activity, cellular distribution or cellular uptake of the 
oligonucleotide. The compounds of the invention can include conjugate groups covalently bound to functional 
groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators , 
reporter molecules, poiyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the 

10 pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of 

oligomers. Typical conjugates groups include cholesterols, lipids, cation lipids, phospholipids, cationic 
phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, 
coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, 
include groups that improve oligomer uptake, enhance oligomer resistance to degradation, and/or strengthen 

1 5 sequence-specific hybridization with RNA. Groups that enhance the pharmacokinetic properties, in the context 
of this invention, include groups that improve oligomer uptake, distribution, metabolism or excretion. Conjugate 
moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. 
Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 
1053-1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N. Y. Acad. Sci. , 1992, 660, 306-309; 

20 Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. 
Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras 
et al., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., 
Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 
1 ,2^-0-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; 

25 Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan 
et al., Nucleosides 8c Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., 
Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 
1264, 229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety. Oligonucleotides of the 
invention may also be conjugated to active drug substances, for example, aspirin, warfarin, phenylbutazone, 

30 ibuprofen, suprofen, feribufen, ketoprofen, (SM+)-pranoprofen, carprofen, dansylsarcosine, 

2,3,5-triiodobenzoic acid, flufenamic acid, folinic acid, a benzothiadiazide, chlorothiazide, a diazepine, 
indomethicin, a barbiturate, a cephalosporin, a sulfa drug, an antidiabetic, an antibacterial or an antibiotic. 
Oligonucleotide-drug conjugates and their preparation are described in U.S. patent application Ser. No. 
09/334, 130 (filed Jun. 15, 1999) and United States patents Nos.: 4,828,979; 4,948,882; 5,218,105; 5,525,465; 

35 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 
5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 
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4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 
5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 
5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 
5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 
5,688,941, each of which is herein incorporated by reference. 

It is not necessary for all positions in a given compound to be uniformly modified, and in fact more 
than one of the aforementioned modifications may be incorporated in a single compound or even at a single 
nucleoside within an oligonucleotide. The present invention also includes antisense compounds which are 
chimeric compounds. "Chimeric" antisense compounds or "chimeras/' in the context of mis invention, are 
antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each 
made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These 
oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon 
the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased 
binding affinity for the target nucleic acid. An additional region of the oligonucleotide may serve as a substrate 
for enzymes capable of cleaving RNArDNA or RNArRNA hybrids. By way of example, RNase H is a cellular 
endonuclease which cleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, results 
in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene 
expression. Consequently, comparable results can oftenbe obtained with shorter oligonucleotides when chimeric 
oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target 
region. Chimeric antisense compounds of die invention may be formed as composite structures of two or more 
oligonucleotides, modified oligonucleotides, oligonucleosides andVor oligonucleotide mimetics as described 
above. Preferred chimeric antisense oligonucleotides incorporate at least one T modified sugar (preferably 
2'-0-(CH 2 ) 2 -0-CH3) at the 3 ' terminal to confer nuclease resistance and a region with at least 4 contiguous 2 ? -H 
sugars to confer RNase H activity. Such compounds have also been referred to in the art as hybrids or gapmers. 
Preferred gapmers have a region of 2' modified sugars (preferably 2 , -0-(CH 2 ) 2 -0-CH 3 ) at the 3' -terminal and 
at the 5* terminal separated by at least one region having at least 4 contiguous 2'-H sugars and preferably 
incorporate phosphorothioate backbone linkages. Representative United States patents that teach the preparation 
of such hybrid structures include, but are not limited to, U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 
5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, 
each of which is herein incorporated by reference in its entirety. 

The antisense compounds used in accordance with this invention may be conveniently and routinely 
made through the well-known technique of solid phase synthesis. Equipment for such synthesis is sold by 
several vendors including, for example, Applied Biosystems (Foster City, Calif.). Any other means for such 
synthesis known in the art may additionally or alternatively be employed. It is well known to use similar 
techniques to prepare oligonucleotides such as the phosphorothioates and alkylated derivatives. The compounds 
of the invention may also be admixed, encapsulated, conjugated or otherwise associated with other molecules, 
molecule structures or mixtures of compounds, as for example, liposomes, receptor targeted molecules, oral, 
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rectal, topical or other formulations, for assisting in uptake, distribution and/or absorption. Representative 
United States patents that teach the preparation of such uptalce, distribution and/or absorption assisting 
formulations include, but are not limited to, U.S. Pat. Nos. 5,108,921; 5,354,844; 5,416,016; 5,459,127; 
5,521,291; 5,543,158; 5,547,932; 5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 
5,213,804; 5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 5,469,854; 
5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 5,595,756, each of which is herein 
incorporated by reference. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases 
affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, 
intercalating agents, such as eilipticine, and alkylating agents or metal complexes may be attached to sense or 
antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequence. 

Antisense or sense ohgonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaPO 4 -mediated DNA transfection, 
electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target 
nucleic acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable 
retroviral vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a 
retrovirus derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see 
WO 90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target 
nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The 
sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous 
lipase. 

Antisense or sense RNA or DNA molecules are generally at least about 5 nucleotides in length, 
alternatively at least about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 
28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 
145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 
300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 
510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 
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720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 
930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides in length, wherein in tins context the term "about* 
means the referenced nucleotide sequence length plus or minus 10% of that referenced length. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
of closely related TAT coding sequences. 
5 ' Nucleotide sequences encoding a TAT can also be used to construct hybridization probes for mapping 

the gene which encodes that TAT and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

1 0 When the coding sequences for TAT encode a protein which binds to another protein (example, where 

the TAT is a receptor), the TAT can be used in assays to identify the other proteins or molecules involved in 
the binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor TAT can be used to isolate correlative ligand(s). 

1 5 Screening assays can be designed to find lead compounds that mimic the biological activity of a native TAT or 
a receptor for TAT. Such screening assays will include assays amenable to high-throughput screening of 
chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 

20 cell based assays, which are well characterized in the art. 

Nucleic acids which encode TAT or its modified forms can also be used to generate either transgenic 
animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 
which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 

25 stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding TAT can be used to clone genomic DNA encoding TAT in 
accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding TAT. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. 

30 Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for TAT transgene 

incorporation with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding 
TAT introduced into the germ line of the animal at an embryonic stage can be used to examine the effect of 
increased expression of DNA encoding TAT. Such animals can be used as tester animals for reagents thought 
to confer protection from, for example, pathological conditions associated with its overexpression. In 

35 accordance with this facet of the invention, an animal is treated with the reagent and a reduced incidence of die 
pathological condition, compared to untreated animals bearing the transgene, would indicate a potential 
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therapeutic intervention for the pathological condition. 

Alternatively, non-human homologies of TAT can be used to construct a TAT "knock out" animal 
which has a defective or altered gene encoding TAT as a result of homologous recombination between the 
endogenous gene encoding TAT and altered genomic DNA encoding TAT introduced into an embryonic stem 
cell of the animal. For example, cDNA encoding TAT can be used to clone genomic DNA encoding TAT in 
accordance with established techniques. A portion of the genomic DNA encoding TAT can be deleted or 
replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 
integration. Typically, several kUobases of unaltered flanking DNA (both at the 5* and 3' ends) are included 
in the vector [see e.g., Thomas and Capecchi, CeJL 51:503 (1987) for a description of homologous 
recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) 
and cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected 
[see e.g., Ii et al., Cell, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal 
(e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic 
Stem Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152]. A chimeric embryo 
can then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to 
create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can 
be identified by standard techniques and used to breed animals in which all cells of die animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 
against certain pathological conditions and for their development of pathological conditions due to absence of 
the TAT polypeptide. 

Nucleic acid encoding the TAT polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic 
agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 
Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnikef a/., Pmc. Nad. Acad, gci. USA 83:4143-4146 [1986]). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by 
uncharged groups. . 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether die nucleic acid is transferred into cultured cells in vitro, or in vivo 
in the cells of die intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in 
vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium 
phosphate precipitation method, etc. The currently preferred in vivo gene transfer techniques include 
transfection with viral (typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau 
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et al., Trends in Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic 
acid source with an agent that targets the target cells, such as an antibody specific for a cell surface membrane 
protein or the target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins 
which bind to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to 
facilitate uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for 
5 proteins which undergo internalization in cycling, proteins that target intracellular localization and enhance 
intracellular half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu et al. , 
J. Biol. Chem. 262. 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). 
For review of gene marking and gene therapy protocols see Anderson et al., Science 256, 808-813 (1992). 

The nucleic acid molecules encoding the TAT polypeptides or fragments thereof described herein are 

1 0 useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 
markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each TAT nucleic acid molecule of the present invention can be used as a chromosome marker. 

The TAT polypeptides and nucleic acid molecules of the present invention may also be used 
diagnostically for tissue typing, wherein the TAT polypeptides of the present invention may be differentially 

15 expressed in one tissue as compared to another, preferably in a diseased tissue as compared to a normal tissue 
of the same tissue type. TAT nucleic acid molecules will find use for generating probes for PGR, Northern 
analysis, Southern analysis and Western analysis. 

This invention encompasses methods of screening compounds to identify those that mimic the TAT 
polypeptide (agonists) or prevent the effect of the TAT polypeptide (antagonists). Screening assays for 

20 antagonist drug candidates are designed to identify compounds that bind or complex with the TAT polypeptides 
encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins, including e.g., inhibiting the expression of TAT polypeptide from cells. Such 
screening assays will include assays amenable to high-throughput screening of chemical libraries, making them 
particularly suitable for identifying small molecule drug candidates. 

25 The assays can be performed in a variety of formats, including protein-protein binding assays, 

biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

All assays for antagonists are common in that they call for contacting the drug candidate with a TAT 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

30 In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 

reaction mixture. In a particular embodiment, the TAT polypeptide encoded by the gene identified herein or 
the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution 
of the TAT polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, 

35 specific for the TAT polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is 
performed by adding the non-immobilized component, which may be labeled by a detectable label, to the 
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immobilized component, e.g., the coated surface containing the anchored component. When the reaction is 
complete, the non-reacted components are removed, e.g., by washing, and complexes anchored on the solid 
surface are detected. When the originally non-immobilized component carries a detectable label, the detection 
of label immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular TAT polypeptide encoded 
by a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for 
detecting protein-protein interactions. Such assays include traditional approaches, such as, e.g. , cross-linking, 
co-immunopreeipitation, and co-purification through gradients or chromatographic columns. In addition, 
protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and co- 
workers (Fields and Song, Nature (London). 340:245-246 (1989); Chien et aL, Proc. Natl. Acad. Sci. USA, 
88:9578-9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA, 89: 5789-5793 
(1991). Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular 
domains, one acting as the DNA-binding domain, the other one functioning as the transcription-activation 
domain. The yeast expression system described in the foregoing publications (generally referred to as the "two- 
hybrid system") takes advantage of this property, and employs two hybrid proteins, one in which the target 
protein is fused to the DNA-binding domain of GAL4, and another, in which candidate activating proteins are 
fused to the activation domain. The expression of a GAL1- lacZ reporter gene under control of a GAL4- 
activated promoter depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies 
containing interacting polypeptides are detected with a chromogenic substrate for p-galactosidase. A complete 
kit (MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a TAT polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of die gene and the intra- or extracellular component under conditions and for a time 
allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, die reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reactions) but not in the reaction mixture 
containing the test compound indicates that the test compound interferes with the interaction of the test 
compound and its reaction partner. 

To assay for antagonists, the TAT polypeptide may be added to a cell along with the compound to be 
screened for a particular activity and the ability of the compound to inhibit die activity of interest in the presence 
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of the TAT polypeptide indicates that lie compound is an antagonist to the TAT polypeptide. Alternatively, 
antagonists may be detected by combining the TAT polypeptide and a potential antagonist with membrane-bound 
TAT polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition 
assay. The TAT polypeptide can be labeled, such as by radioactivity, such that the number of TAT polypeptide 
molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene 
5 encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, 
ligand panning and FACS sorting. Coligan et al., Current Protocols in Immun.. 1(2): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein poly adenylated RNA is prepared from a cell responsive to 
the TAT polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the TAT polypeptide. Transfected cells that are grown on glass 

1 0 slides are exposed to labeled TAT polypeptide. The TAT polypeptide can be labeled by a variety of means 
including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 
single clone that encodes the putative receptor. 

15 As an alternative approach for receptor identification, labeled TAT polypeptide can be photoaffinity- 

linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material 
is resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 

20 library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing die receptor 
would be incubated with labeled TAT polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 

25 immunoglobulin with TAT polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the TAT 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 

30 TAT polypeptide. 

Another potential TAT polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods 

35 are based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of die 

polynucleotide sequence, which encodes the mature TAT polypeptides herein, is used to design an antisense 
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RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res., 
6:3073 (1979); Cooney et al., Science. 241: 456 (1988); Dervan et al., Science, 251:1360 (1991)), thereby 
preventing transcription and the production of the TAT polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the TAT polypeptide 
(antisense - Okano, Neurochem. . 56:560 (1991); Oligodeoxvnu cleotides as Antisense Inhibitors of Gene 
Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 
to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the TAT 
polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation 
site, e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 
growth fector or other relevant binding site of the TAT polypeptide, thereby blocking the normal biological 
activity of the TAT polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specific hybridization to die complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 
known techniques. For farther details see, e.g., Ross Current Biology. 4:469-471 (1994), and PCT publication 
No. WO 97/33551 (published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
97/33551, supra. 

These small molecules can be identified by any one or more of die screening assays discussed 
hereinabove and/or by any other screening techniques well known for those skilled in the art 

Isolated TAT potypeptide-encoding nucleic acid can be used herein for recombinantly producing TAT 
polypeptide using techniques well known in die art and as described herein. In turn, the produced TAT 
polypeptides can be employed for generating anti-TAT antibodies using techniques well known in the art and 
as described herein. 
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Antibodies specifically binding a TAT polypeptide identified herein, as well as other molecules 
identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various 
disorders, including cancer, in the form of pharmaceutical compositions. 

If the TAT polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the 
variable-region sequences of an antibody, peptide molecules can be designed that retain die ability to bind the 
target protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
technology. See, e.g., Marasco etal., Proc. Natl. Acad. Sci. USA, 90: 7889-7893 (1993). 

The formulation herein may also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely affect each other . 
Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are 
suitably present in combination in amounts that are effective for the purpose intended. 

The following examples are offered for illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 
reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
VA. 

EXAMPLE 1: Anal ysts of Differential TAT Polypeptid e Expression by GEPIS 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and interesting EST sequences were identified by GEPIS. Gene expression profiling in silico 
(GEPIS) is a bioinformatics tool developed at Genentech, Inc. that characterizes genes of interest for new cancer 
therapeutic targets. GEPIS takes advantage of large amounts of EST sequence and library information to 
determine gene expression profiles. GEPIS is capable of detenrdning the expression profile of a gene based 
upon its proportional correlation with the number of its occurrences in EST databases, and it works by 
integrating the LIFESEQ® EST relational database and Genentech proprietary information in a stringent and 
statistically meaningful way. In this example, GEPIS is used to identify and cross-validate novel tumor 
antigens, although GEPIS can be configured to perform either very specific analyses or broad screening tasks. 
For the initial screen, GEPIS is used to identify EST sequences from the LIFESEQ® database that correlate 
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